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RECOMBINANT fULTTVALENT M PROTEIN VACCINE 
FIELD OF THE INVENTION 



The invention relates to recombinant multivalent M protein vaccines useful to control 
group A streptococcal infections of different serotypes which may result in rheumatic fever, 
rheumatic heart disease, and to other embodiments further described herein. 

BACKGROUND OF THE INVENTION 

Acute rheumatic fever (ARF) is the major cause of heart disease in children around 
the world. The disease is rampant in developing countries where prevalence rates of rheumatic heart 
disease may be as high as 35-40 per thousand individuals. By one estimate, it affects nearly six millon 
school-age children in India. Although the incidence of ARF in the United States and other Western 
countries declined markedly during the later half of tie twentieth century, there has been a recent 
remarkable resurgence of the disease in the United States. Hence, the need for a safe and effective 
vaccine is urgent and serious. 

Streptococci are a group of bacteria with the capacity to grow in chains. Many 
varieties are part of the normal bacterial flora in humans and are not especially harmful. However, 
a particular subgroup of streptococcal bacteria, called group A and represented by Streptococcus 
pjrogenes, is a human pathogen. Between 20 and 30 millon cases of group A streptococcal infections 
occur every year in the United States alone. These cases include infections of the skin and throat, 
forms of pneumonia and a more recently identified disease resembling toxic shock. The most 
common infection is acute streptococcal pharyngitis, or strep throat, which occurs predominantly in 
school-age children. Strep throat qualifies as a major worldwide health problem if judged only by 
time lost from school and work and by the amount spent on related doctor's fees. 

Strep throat's toll is much greater, however. In as many as 4% of the pharyngitis cases 
that are untreated or treated ineffectively, the strep infection leads to ARF. Current attempts to 
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reveut ARF re.y on treatment of the pharyngitis with antibiotics. During a recent outbreak of ARF 
i„ Utah, on.y a fourth of the patients sought health care prior to the onset of symptom, and only a 
third recailed a recent sore throat. The finding that ARF may follow a subcecal infection in such 
a high percentage of individuals and the fact that access to hea.th care in developing countries is not 
widely available serve to underscore the need for a safe and effective vaccine against group A 
streptococci. 

The causal relationship between streptococcal pharyngitis and ARF was established 
over 50 years ago, yet the mechanism of the pathogenes.s of the disease remains unc.ear. It is widely 
held that ARF is an autoimmune disease, and that in the susceptible host the infection triggers an 
immU ne response that leads to inflammatory and sometimes destructive changes in target tissues. 
Streptococci have been shown to contain antigens that are immunologically cross- reactive with host 
tissues and heart-cross-reactive antibodies from patients with rheumatic fever have been shown to 
react with streptococci. However, it was also shown that sera from patients with uncomplicated 
pharyngitis also may contain heart-cross-reactive antibodies, yet these patients do not develop clinical 
evidence of carditis. Until the significance of tissue- cross -reactive antibodies in the pathogenesis of 
ARF is better understood, there remains a need to exclude potentially harmful epitopes from vaccine 
preparations. 

The surface M protein of group A streptococci is the major virulence factor and 
protective antigen of these organisms, group A streptococci have developed a system for avoiding 
some of the antimicrobial defenses of a human host. Strains of streptococci that are rich in M protein 
evade phagocytosis by PMNs and multiply in non-immune blood. Yet, resistance to an infection by 
these bacteria is possible if the host's body can produce opsonic antibodies directed against the M 
protein. Such antibodies will neutralize the protective capacity of the M protein and allow the 
streptococcus to be engulfed and destroyed by phagocytes. The development of secretory or mucosal 
antibodies as opposed to serum opsonic antibodies, are also now suspected of playing an important 
role in preventing streptococcal infections. 
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A major obstacle to effective vaccine development has been the very ,arge „ umber 0 f 
M protein serotypes. See, StpHe^M, -Rheumatic Fever and Streptococcal Infection, Grune * 
Stratton (1975). These are reported to number about 82 to date and more can be expected H. 

identified. 

It has been shown that antibodies against one serotype do not necessarily offer 
protection against others aithough some do cross-react with others. Immunity then appears to be type 
or sero-specific and optimal vaccines would require that most of the serotypes be represented. The 
conceptof-rheumatogeni^^ 

studies over many years and in diverse areas of the world. Thus, there are probably about 12-15 
serotypes response for most cases of ARF. Some of these are types 1, 3, 5, 6, 14, 18, 19, 24, 27 and 

29. 

To assist in a better understanding of the invention, a description of the M protein 
structure is useful. See, Scientific American, June 1991, Streptococcal M Protein by Vincent A 
Fischetti. Considering a typical M protein structure such as that of type M6, approximately 80 
percent of the M6 molecule is made of four distinct regions, each of which consists of repeated 
sequences of amino acids. These regions are arbitrarily designated by the letters A through D. Near 
tne N -terminal, or amino end, the part of the molecule farthest from the bacterial cell, lies region A 
This region has five tandem repeats, or blocks, of 14 amino acids each. The three central repeats are 
identical, whereas the repeats at each end of the region diverge slightly from the common amino acid 
sequence. Next on the molecule is region B, which has a similar five-repeat structure except that the 
repeated blocks contain 25 amino acid, Region C consists of two and a half tandem repeats of 42 

Region D is composed of four partial repeats containing seven amino acids. The section buried in the 
cell extends from about the last repeat of the C region to the C-terminus. 

Adjacent to the D-repeat blocks is a non-repeat region containing an abundance of 
pro,i„e and glycine amino acids, which are distributed in a nearly regular pattern. Beyond that region 

-3- 



" PCT/US93/08T03 

WO 94/0642 1 

lia c,e,n.ina,. or carboiy,, end or .he mo.ecu.e. -««■ " *■ - -™' - °' 
, er „ L „„ end are 20 hydrophobic amino -Id. and. « *• -« — "* 

Similar ,rr.n 8 en,e..s <* "P«< ■** " "- ' yPe 5 ' " 

o,H„ s„ep,.coee, An ali 8 nmen, of ,e -U. a- - - M — 

rh a „ 98 oercent identical. Closer to the N-terminus, however, 
that their C-ierminal ends are more than 98 percent .ae 

. M oroteins increase. Consequently, the A-repeat blocks and a short 
differences in sequence among M prote.ns increas 

■A on of about 10 to about 20 amino acids at the N-terminus are unique for each M 
amino acid region of about iu 10 auu 

T , - e&s is the mai or determinant of the sero-specifichy of the immunological 
molecule. This uniqueness is the major 

response. 

U .he amino acid s«,.«nce of M6 and ia.e, discovered i. O.K., M pro.ein, a.o.he, 
to i 8 uin 8 s.ruc,u,., de.ail revealed irself. Runnin, ■« * " 

seven-am... add pa««™: * — «•* - * f °"" k ^"^ " * 

The s«.en-«»i< pa.rern i. .be a,r». 8 ««««.of .he amino acids i. M6 indices .ha, .ae 

no, is ..a. pace,, found i. .a., o,h«, coiled-coi! — — "° b * — 

tor rue Hexibili., of ..e M molecules observed in elec.ro. micros. More l-f««. <* 
ch„ae,e,is.ics o, ,..s« irregulars ««« - A- - - <=■■— ° b *' Vali °" 
su 88 ,s.s ,ha t each repea. re 8i .n evo.ved i.depe.den.ly «d may Have a dis.i.c. fu.c.i... F« - 
iU us„.,.on of ,he p,o,ei. sconce of »6 de.ermi.ed by elo.ing Us gene, and for differ, fores of 
r e,a.,d M pro.ei.s when mu.an. s.rep.oc.cci del... cop.es of .he amino acid r.pea.s found in .he 
par...., moled., especial* .. .he N-erminus, see Scien.ific American, ci.ed above. S.udies have 
sh„». .ha. each M pro,.i. fiber o, a s.rep.occeal c.„ wall is abou. 50 ,. 60 b.llion.hs o, a m..e, 
to., and consists of a single coiled-coil dime, (.wo M p,o,eius coiled around each ,.her). 

is likel, .ha. M pro.=ins of all sero.ypes are buiU along a basic .heme-, .he, have , 
„. 8 ,h, coiled.ci. rod ,e 8 ion in .heir cen.ers .ha. is f.anV.d by a floppy sec.io. a, .he N-ermi... 
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end and an anchoring region at the C-terminal end. Because the alpha-helical coiled-coil structure 
can accommodate a large number of varying amino acid sequences, many different M proteins with 
the same general conformation can be constructed, as is shown hereinafter. 

For an M protein to protect a streptococcus, it must be able to attach to the organism. 
The mechanism that holds surface proteins on gram-positive bacteria is still poorly understood, but 
various studies of the M protein have been enlightening in that respect. 

It is believed that the 20-hydrophobic amino acids near the C-terminal end are 
positioned into the similarly hydrophobic membrane itself, whereas the charged amino acids at the 
very terminus protruded into the aqueous cytoplasm. Because the charged amino acids would resist 
moving into a hydrophobic environment, they would act like a knot at the end of a string, preventing 
the M molecule from being pulled through the membrane. That mechanism may be valuable for some 
proteins attached to membranes. More recent evidence indicates, however, that the attachment 
mechanism for M protein and other bacterial surface proteins may actually be more sophisticated. 
Studies have revealed that all surface proteins from gram-positive bacteria have a similar arrangement 
of hydrophobic and charged amino acids at their C-terminal end. See for instance Fischetti et ah, 
Surface Proteins from Gram-Positive Cocci Share Unique Structural Features, New Persn.rnv,, 
Streptococci and Streptococcal Infections , (G. Orefici, Editor), Gustav and Jena (Publishers) 1992. 

More important, however, a short six-amino acid sequence adjacent to the 
hydrophobic region is highly conserved in all the known surface proteins of gram -positive bacteria. 
The sequence consists of a leucine, a proline, a serine, a threonine, a glycine and a glutamic acid. Its 
designation is usually abbreviated as LPSTGE. 

The importance of the LPSTGE sequence in the attachment of the M protein (and 
probably in all other proteins with this sequence motif) was shown by reported genetic experiments. 
It was found that if only the LPSTGE sequence is removed from the M protein gene, the M molecule 
that was produced would not attach to the bacterial membrane. This result suggested that the 
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hydrophobic domain and the charged amino acids at the C-terminus are not sufficient for membrane 
attachment and that the LPSTGE motif may be an important signal for initiating the process. 

In nearly all surface proteins found in gram-positive bacteria, there is another 
distinctive region that spans about 50 to 75 atnino acids on the N-termina! side of the hydrophobic 
region. This part is probably located within the peptidoglycan. Proline, glycine, threonine and serine 
constitute a high percentage of these amino acids. The reason for their preva.ence has not been fully 
explored, but it is thought that prolines and glycines can create turns and bends in proteins. One 
hypothesis holds that cross-links in the peptidoglycan can weave through the proline- and glycine- 
induced bends, thereby stabilizing the M protein's position in the cell wall. 

The knowledge that all known surface proteins on gram -positive bacteria attach 
themselves by a similar mechanism may open new avenues, such as controlling infections caused by 
these organism, Surface proteins help pathogenic organisms initiate infections. It has been proposed 
that by preventing the proteins from anchoring to the bacterial cell, one should eventually be able to 
block infections and circumvent some of the problems associated with resistance to antibiotic 
therapies. 

Just as the structures at the C-terminal end of the molecule provide information on 
how the M protein attaches to the bacterial cell, structures at the N-terminal end offer clues about 
how the molecule helps to fend off phagocytes. The N-termina. end of all M molecules has an excess 
of negatively charged amino acids, which results in a net negative charge for the region. Mammalian 
cells also exhibit a net negative charge on their surface. It has been suggested that the charge on M 
proteins may thus have evolved to hamper contact between streptococci and phagocytic cells through 
electrostatic repulsion. It has been proposed that one function of the central rod in the M protein is 
to act as a shaft for holding the negatively charged N-terminal end - and phagocytes - away from the 
bacterial surface. 

At the N-terminal end of the coiled-coil rod, there is also a hypervariable region. This 
part of the molecule has a distinctive sequence in each M serotype. The hypervariable region consists 
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of the short 10-30-amino acid non-helical sequence and if present, the adjoining A-repeat region. 
The hypervariable region plays an important role in the biological activity of the molecule; antibodies 
against this area are optimal at promoting phagocytosis and killing of the streptococci. This 
observation again explains why only serotype -specific antibodies protect against strep infections. 

One hallmark of rheumatic fever is the presence of antibodies that react with muscle 
tissue, particularly heart tissue, in a patient's serum. See "Rheumatic Fever" by Earl H. Freimer and 
Maclyn McCarty; Scientific American, December 1965. Normally, antibodies are not made against 
one's own tissues. Researchers have discovered, however, that so-called cross- reacting antibodies can 
sometimes be induced by a molecule in an infective organism that resembles one in the mammalian 
host. In the process of making antibodies against the microbial molecules to clear an 
infection, the body is tricked into generating antibodies against its own tissues (serological cross- 
reactivity), a potentially harmful development. 

It is evident from this description that there is an important and urgent need for a 
vaccine which is effective against the various serotypes of group A streptococci. The vaccine should 
be capable of raising sero-specific antibodies, especially those capable of triggering acute rheumatic 
fever, without eliciting cross -reaction with human tissue. There is also an important need for a 
vaccine which has not only these properties but also is capable of raising protective antibodies against 
infections, sore throat, skin infections, deep tissue infections and the like that are not necessarily but 
frequently are followed by rheumatic fever. The invention contributes to solving these important 
needs in human health. 

Thus, there is an important need for a vaccine effective against streptococci infections 
which provides humoral immune against the diverse serotypes of group A streptococci and, when 
desired, also cellular immune responses. The vaccine should not elicit antibodies which react with 
human heart tissue. 

In conjunction with studies of the M protein of rious serotypes, it has been found 
that in most cases the protective epitopes of M protein may be separated from the potentially harmful, 
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autoimmune epitopes of the molecule (see Ref, 5-7). The NH 2 -termi„al segments of M proteins have 
been found to evoke antibodies with the greatest bactericidal activity. 

Further studies have shown that synthetic peptides copying limited regions of types 
5, 6 and 24 M proteins evoked type -specific, opsonic antibodies that were not heart tissue cross- 
reactive. Because of their lack of immunogenic^, however, it was necessary to chemically link the 
synthetic peptides covalent.y to carrier proteins (see Refs. 5-7). Such fragments of M proteins linked 
to carrier proteins with chemical reagents do not result in hybrid proteins of defined structures. 
Thus, it has not been possible to obtain antigens which can elicit specific, desired 
antibodies without causing an increase of the risk of undesirable side reactions. Further, formation 
of hapten - carrier complexes using chemical cross-linking reagents is time-consuming and costly and 
results in undefined heterogenous mixtures of vaccine components. This invention provides 
multivalent vaccines that are type-specific and do not have the drawbacks of the prior art. . 

ciimmaRV OF THE IN VENTION 

This patent application is related to and is co-filed on the same day as patent 

,. • M entitled "ANTIGEN OF HYBRID M PROTEIN AND 

application No. 

CARRIER FOR GROUP A STREPTOCOCCAL VACCINE" with named inventor James B. Dale 

(attorney's docket no. 372.5770P). 

The invention relates to a recombinant multivalent hybrid M protein vaccine against 
multiple serotypes of group A streptococci. The vaccine comprises a protein molecule which contains 
epitopes (antigenic determinants) that evoke opsonic antibodies against multiple serotypes of group 
A streptococci. The molecule contains one amino acid fragment which contains at least one epitope 
which evokes humoral, opsonic antibodies of a particularly serotype. The respective fragments are 
optionally linked in tandem by linkers which include amino acid, The M protein is free of epitopes 



-8- 



WO 94/06421 

PCT/IS93/08703 

that raise antibodies cross- reactive with the human heart tissue. The hybrid M protein is made by 
recombinant DNA technology. 

Tbe multivalent hybrid M protein contains amino acid fragments which are a mi™ 



terminal fragments of the M protein and hence are capable of eliciting'the desired opsonic antibodies 
against the multiple serotypes of group A streptococci. The invention also relates to multivalent 
hybrid M proteins which, in addition to amino acid fragments which evoke opsonic antibodies against 
specific serotypes of group A streptococcus, also carry amino acid fragments with epitopes which also 
evoke protective, cellular or mucosal antibodies. 

In the hybrid M protein of the invention, the amino acid fragments need not all be 
immunogenic as such, yet when co-expressed with other amino acid fragments, the hybrid molecule 
is immunogenic against at least more than one serotype of group A streptococci. 

Of particular interest are recombinant multivalent hybrid immunogenic M proteins 
which comprise amino terminal fragments of the M protein which contain epitopes that elicit 
antibodies which are causative of rheumatic fever, such as serotypes 1, 3, 5, 6, 14, 18, 18, 24, 27 and 
29 or others. Ideally, the hybrid M protein is constituted of a multiplicity of amino acid fragments 
preselected to elicit antibodies against the particular target serotypes of group A streptococci. 

The invention also relates to a vaccine of the M protein type which is multivalent or 
general, such as tetra, penta or hexavalent against the corresponding number of serotypes of group 
A streptococci. Thus, the invention relates to hybrid proteins which comprise different amino acid 
fragments of the C-repeats of an M protein of different serotypes which cause cellular immune 
responses, particularly those which are not tissue cross- reactive. Such hybrid proteins can also 
include fragments of the amino-terminal of the various serotypes to give a hybrid composite which 
elicits a humoral immune response and one or more cellular immune responses. The multivalent 
vaccines may contain one or more than one, i.e. repeats of any particular amino terminal fragment 
of different serotype. 
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Further, the fragments need not, but may be of the same amino acid lengths. Further, 
the invention contemplates a mixture or a "cocktaii" of hybrid M protein which carry appropriate 
epitopes to elicit the desired opsonic antibodies against the target serotypes. The mixture of these 
hybrids will be effective against a large number, if not all, of the target streptococcal types, 
particularly those which are causative of rheumatic fever. 

The invention also relates to mixtures of individual hybrid M protein molecules which 
are capable of not only eliciting opsonic antibodies against multiple serotypes of group A streptococci 
but also to elicit mucosa, antibodies. In this manner, the invention provides as desired, a complete 
vaccine not only against those serotypes more likely to initiate or cause rheumatic fever, but also those 
which cause infections described above, which often can lead to rheumatic fever. 

In accordance with the invention, the amino acid fragments which carry the desired 
protective epitopes (opsonic or mucosal) may be (but need not be) fused to each other in tandem by 
Unkers which comprise amino acids. A great variety of such amino acid linkers can be used in 
accordance with the invention. It is desirable that the amino acids contribute to the orientation, 
conformation and in effect to the immunoaccessability of the epitopes of the fragments so as to 
generate an optimum immune response. It is not excluded that these amino acid linkers contain one 
or more molecules which are not amino acids. 

In accordance with the invention, the order, i.e., the sequence of the amino acid 
f ragmeuts that constitute the hybrid M proteins which carry the protective epitopes can be changed 
as desired to maximize the immunogenicity of the molecule. 

As referred to above, the invention relates to hybrid M proteins which include amino 
acids of the amino termini of the M proteins, i.e. M24, M5, M6 and M19 and amino acids of the 
carboxyl-terminus of type 5M protein. This vaccine raises type-specific opsonic antibodies against 
all of the related M fractions, cross- protective mucosal immune responses against two or more of 
these and cellular immunity. 
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The invention also relates to a particular PCR method which permits to organize the 
coding nucleotide sequences to express the desired amino acid fragments (or sequences) in the order 
desired. 

The invention also relates to a method of immunization with the therapeutic 
recombinant multivalent hybrid M proteins of the invention or with a composition which comprises 
the recombinant hybrid multivalent M protein and an appropriate biochemically or pharmaceutically- 
acceptable carrier. The immunogenic hybrid M proteins of the invention may be formulated with 
the biochemically or pharmaceutical^- acceptable carrier to produce a vaccine which elicits an 
effective level of the desired antibody in the subject mammal, including human beings, to provide 
the desired immunity, i.e. humoral or humoral and cellular. 

Further, the invention relates to avirulent microorganisms transformed (or transacted) 
with recombinant multivalent hybrid M protein genes (or portions) thereof of the invention. The 
microorganism can be avirulent as such or may have been rendered non- virulent by methods known 
in the art. The avirulent host bacterium is unable to colonize in the subject to be immunized 
generally by virtue of a nutritional deficiency; nonetheless the bacterium will multiply just to a 
limited extent to release the M protein antigen and elicit the appropriate antibodies. Such 
compositions are very well suited for oral administration. 

The invention also relates to the hybrid genes which code for and express the desired 
hybrid M proteins in an appropriate self- replicative vehicle. 

The invention further provides for modification of the amino acid sequences 
constituting the hybrid antigenic molecule by chemical method if it is desired to add to and/or replace 
any one of the amino acids by another molecule to increase and/or modify the immunogenic^ of the 
hybrid molecule. 

This summary of the invention is not intended to summarize all the embodiments (or 
aspects) of the invention. 
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no IFF DESCRIPTION OF THE FIGURES 

FIG.l shows the recombinant DNA nucleotide sequence of recombinant and tri valent 

amino acid sequences of M24-M5-M6. 

FIG. 2 shows the immunoblol analysis of purified M24-M5-M6. 
FIG. 3 shows ELISA inhibition assays. 

FIG. 4 shows the recombinant DNA nucleotide sequence and deduced amino acid 
sequence of tetravalent amino acid sequence of tetravalent M24-M5-M6-M19 hybrid molecule. 

FIG. 5 shows the immunoblot analysis of M24-M5-M6-M19 hybrid vaccine. 

FIG. 6 shows the DNA recombinant nucleotide sequence and deduced amino acid 
sequence of tetravalent hybrid M24-M5-M6-M19 with different linkers. 

FIG. 7 shows the DNA recombinant nucleotide sequence and deduced amino acid 
sequence of tetravalent hybrid M24-M5- M6-M19 constituted of repeats of smaller fragments of each 
of the different serotypes with these fragments being directly linked by their respective amino and 
carboxy ends to the adjoining fragment at the indicated restriction sites. 

FIG. 8 shows the DNA recombinant nucleotide sequence of tetravalent hybrid M24- 
M5-M6-M19-C- terminus of 915 nucleotides encoding the 305 amino acid long hybrid by having the 
COOH terminal half of M5 joined at restriction site Pstl. 

FIG. 9 shows the DNA recombinant nucleotide sequence and deduced amino acid 
sequence of the tetravalent hybrid M19-M6-M5-M24 where the sub-units are in the reverse order 
than in construct shown in FIG. 4. No linkers join the fragments of the nucleotide and the of the 
amino acids. 

FIG. 10 shows the DNA recombinant nucleotide sequence of a divalent M24-M5 

hybrid. 

FIG. 11 shows the DNA recombinant nucleotide sequence of tetravalent M19-M6- 
M5-M24 of 1029 nucleotides long linked to C-repeats of the carboxy terminal. 
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FIG. 12 shows the DNA recombinant nucleotide sequence of tetravalent M24-M5- 
M6-M19 with short 15 amino acid units linked directly to each other. 

FIG. 13 shows the DNA recombinant nucleotide sequence of octavalent hybrid protein 
M24-M5-M6-M19-M4-M1-M18-M12 which contains a non-rheumatogenic serotype of streptococci 
(M12). 

Suitable vectors for cloning the selected DNA fragments of the M protein are available 
commercially. See the Bibliography provided herewith. Expression of the hybrid protein is carried 
out by suitable prokaryotes, like E. opji or if desired eukaryotes, like yeast. 

DETAILED DESCRIPTI ON OF THE PREFERRED EMBODIMENTS 

Various embodiments of the invention are more fully described hereinafter. 

The construction of a trivalent and of a tetravalent hybrid antigen is described in 
greater detail, but it is understood that similar protocols are applicable to construct the other hybrid 
antigens described herein. 

The method of the invention for constructing the multivalent hybrid constructs 
involves in vitrs recombinant DNA technology. The method may be described in a general manner 
as follows. A polyvalent hybrid gene is constructed using a selected native fragment of the desired 
length and is constituted of a desired M gene like of M24, 1, 5, etc. (designated emm.24, emml and 
Smm5, respectively). The fragment is free of nucleotides that encode an amino acid sequence that 
can cause tissue cross - reactivity. The DNA sequences encoding amino acid fragments free of epitopes 
which cause autoimmune responses are identified as shown in the literature, for instance References 
5, 6, 7 in the attached Bibliography. The fragment is amplified by polymerase chain reaction (PCR), 
purified and ligated into a self-replicating vehicle, e.g., a plasmid, like pBR322. Oglionucleotide 
pairs copying the desired codons of the structural other genes that make up the multivalent gene, 
respectively, are synthesized to contain appropriate restriction sites to facilitate ligation to the first 
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selected gene, e.g. emm 24. The entire polyvalent construct is excised from the vehicle (e.g., pBR322) 
and ligated to allow hi-h level expression of the recombinant hybrid protein in an appropriate plasmid 
e.g. pKK223-3. 

The selected gene (e.g. emra24) is amplified by PCR using synthetic oglionucleotide 
primers that specify amplification of the gene that encodes the desired region of the amino- terminal 
portion e.g., half of the molecule, e.g., the pep M24 region following the procedures and materials 
disclosed in Beachey et aj., 1978 entitled "Repeating Covalent Structure of Streptococcal M Protein, 
Proc. NatL Acad. Sci. USA, 75, 3163-3167 and Mouw et al., 1988, entitled "Molecular Evolution of 
Streptococcal M Protein, J BacterioL 170, 676. With respect to the structural gene of type 24M, the 
complete nucleotide sequence includes an open reading frame of 1617 base pairs encoding a pre-M24 
protein of 539 amino acids a predicted M.W., of 58,738. The structural gene contains two distinct 
tandemly reiterated elements. The first repeated element consists of 5.3 units, and the second contains 
2.7 units.' Each element shows little variation of the basic 35-amino acid unit. 

The (top and bottom PCR) primers are synthesized by an automated DNA synthesizer 
(ABI, Model 381A). When necessary i.e. when the region to be amplified does not contain an 
appropriate signal sequence and native start codon, the top strand primer is extended on the 5' end 
by an appropriate start codon like ATG. 

This is performed in the case of emm 24. Appropriate restriction sites (e.g. EcoRl) are 
also added and a GGG tail incorporated to ensure that the enzyme would recognize the cleavage site 
in the purified, double stranded DNA product. The bottom strand primer is extended on the 5' end 
by an appropriate restriction site (e.g., BamHl) and a CTC tail. 

PCR amplification is performed using chromosomal DNA extracted from the selected 
type streptococci (e.g. type 24), which is then used as template. See Dale et aL, Type-Specific 
Immunogenicity of a Chemically Synthesized Peptide Fragment of Type 5 Streptococcal M Protein, 
J. Exn. Med. . 158, 1727(1983). The reaction mixtures comprise template DNA, primer pairs, dNTPs 
and laa DNA polymerase in PCR buffer. Amplification is performed in an appropriate automatic 
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th.rma, cyder „i,h deoaturation, „ n « ali „ 8 p(i>e( _ ^ 

methods. The PCR product is „eetro P h.,esed by ^ „ elhodo , 08y ^ [bj purincd ^ ^ 

" '^"^ '" i ° S K '"°- "« « -~»»r with lt ,. WJaK _,. wi „ i0 „_ 

enjymes, in Hi, case Ec2 R1 aod Smm |n appropriate bufftrs 

The purified PCR product (e.g. ^24, is ligated int0 „ appropriale „„ .^.^ 

" pBR " 2 "** rei,tic "°" •»»- «■» a»"-d a»H>. t„« plaso , id is „„„ 

to transform E, £Sii strain mC1061 by standard method. Transform,,* are screened for the „,„„« 
Of piastre c.»„,.i. g inserts of , he appropriaK sjze !els ^ pusm . d fw ^ 

ligation of the other synthetic oglionucleotid. pairs previously synthesized. 

The selected oglio.ueieoddes (^5) are m.xed i. e,„i.rn», a , mlos „d a „„ w ,d 10 
re-anneai. The selected ,g,i„.„c„„,ide pair (e.g.^5, is ,„„ „ ^ ^ ^ ^ 

used to transfer. .. appropriate strain ofs^ such as mC.Oof. riasmids co.tai.in, app , opriaIely 
— « identified and one is seated fo, ligation of ,., addition., „ g ,io.„c,.„,ide pai ,s (e g 
mm» which is accomplished in a similar manner. The resuiti.g poiy.alen, piasmid e.g. pCDM24 
M5-M, is the. purified and cn, wit. restrfctio. ,u 2 y„,s. The excised po, yya ,e., h yb ,id g ,„ „ , 
^4-5.0) « the. ligated into , hi 8 h ,e„el expression v .„„ <,.,. pKK2 „. 3) ,„„ ^ , 
promoter and a ribosom, binding sit. adjacent ,„ the cloning side, e.g. l^R,. 

in a similar manner, other multi™,.., hybrid gene constructs are ,y.,,esi«d For 
l.,a.ce ,h. UU.H,.m.H» tetrayalen, hybrid protein i, constructed u s i. s fronts 0 , lhe , 
-cgtonof theses amplified by PCR, p.Hfi.d, Mgated and tandem a „d expressed in P KK 223 . 

Seoue.ce enaiysis «, ,h e po , yvaI „ t iybriJ ^ ^ ^ ^ 
appropriate inserts in an , pp r„ p ,ia„ .ehiCe, such as pKK 223 .3 by the did,„, y .. u c„o,ide chain 
termini, method of Sanger^. DNA Seoue.oing »i,h Ch,i..Ter„i.„i„ 8 inhibitors'^ 

Afiai ScL USA, 74, 5463 (1977). 
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The method of the invention includes the purification of the recombinant hybrid M 
proteins. These are p. .ned from extracts of an E. coJi, cells are pelleted and treated by standard 
purification methods, and finally lyophilized. 

The invention also includes a method for synthesizing by PCR a recombinant DNA 
sequence which is constituted by the desired nucleotide in the particular order (sequence) and of the 
particular size and orientation desired. Considering that the objective is to synthesize a DNA 
sequence constituted of sequences A-B-C-D the following procedure is followed. After cutting PGR- 
generated fragment A and B with the desired restriction enzyme at the unique sites encoded by 5' end 
of PCR primers, A and B are ligated to each other. The A-B fragment is amplified using as primer 
the A top strand oligonucleotide and the B bottom strand oligonucleotide, thus amplifying only the 
A-B ligation product. This product is ligated to fragment C. The A-B-C fragment is in turn PCR 
amplified using the top oligonucleotide strand of A as primer and the C bottom strand oligonucleotide 
as primer, thus only amplifying the A-B-C strand. 

This product is ligated to fragment D. The fragment A-B-C-D is in turn amplified 
using the top oglionucieotide strand of A as primer, and the bottom strand of D as primer, thus 
amplifying only the A-B-C-D strand desired. 

In a variant of the procedure, A-B fragment can be amplified, C-D synthesized, as 
described for A-B. Then A-B and C-D are ligated and PCR amplified using the top oglionucieotide 
strand of A as primer and the D bottom strand oligonucleotide, thus amplifying the desired A-B-C-D 
sequence. 

If it is desired to incorporate additional sequences containing for instance, amino acids 
as linkers (as described herein) between the amino acid segments, these molecules (e.g. amino acids) 
can be encoded by nucleotides contained in the primers utilized to amplify the desired fragment(s). 

Thus, the PCR-generated fragments are ligated in the desired order and portions are 
PCR amplified using the upper and lower strands as illustrated. The method avoids arrangements of 
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nucleotide segments in an undesired sequence or orientation. The same methodology is followed for 
longer fragments and variations- can be readily provided. 

The polyvalent hybrid M proteins of the invention are tested by immunization of 
rabbits, a classic test animal. Assay for M protein antibodies is performed in accordance with known 
methods as described in the appended bibliography. So are assays for heart cross-reactive antibodies 
which are described in Beachey and Dale, (1982). 

The invention also includes the construction of hybrid constructs containing repeating 
ami no -terminal M protein sub-units using PCR. A non-limiting illustration is described hereinafter. 
The method is readily applied to any number of amino acids of any particular selected serotype to 
yield a hybrid gene containing its selected repeated amino acid fragment. 

A typical trivalent gene M24- M5-M6, was constructed in a general manner as follows. 
The M24-M5-M6 hybrid gene was constructed using a native fragment of the emm 24 
gene that was amplified by the polymerase chain reaction (PCR), purified and ligated into pBR322. 
Oligonucleotide pairs copying the first 11 and 12 codons of the structural emm5 and emm6 genes, 
respectively, were synthesized to contain appropriate restriction sites to facilitate ligation to the 
emjn24 gene. The entire trivalent construct was excised from pBR322 and ligated into P KK223-3 
to allow high-level expression of the recombinant hybrid protein. 

The ejnm.24 gene was amplified by PCR using synthetic oligonucleotide primers that 
specified amplification of the portion of the gene that encodes the pep M24 region of the protein 
which is approximately the amino- terminal half of the molecule. The primers were synthesized by 
an automated DNA synthesizer (ABI, Model 381A) and had the following structures: 
M24-Top Strand PCR Primer 

-EcoRl START 
5' GG GAA TTC ATG GTC GCG ACT AGG TCT CAG 3' 
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M24- Bottom Strand PCR Primer 

5> CGT CTC TTT CGA CTA GAA CTT CCT AGO CTC 5' 

- Bam Hl- 

The top strand primer was extended on the 5' end by an ATG, since the region of 
emm2 4 to be amplified excluded the signal sequence and native start codon. An te Rl restriction 
enzyme site was also added and a GGG tail was incorporated to ensure that the enzyme would 
.cognize the cleavage site in the purified, double -stranded DNA product. The bottom strand primer 
was extended on the 5' end by a BamHl restriction site and a CTC tail. 

PCR amplification was performed using chromosomal DNA extracted from type 24 
streptococci which was used as the template. The reaction mixtures consisted of template DNA, 
primer pairs, dNTPs and J* DNA polymerase in PCR buffer. Amplification was performed in a 
Perk in-Elmer Cetus automatic thermal cyder with denaturation at 95°C for 1 min, primer annealing 
at 55 0C for 1 min and primer extension at 72°C for 3 min for a total of 39 cycles. The PCR product 
was electrophoresed in a 1* low melting- point agarose gel, the band of the predicted size was excised 
and purified by adsorption to and e.ution from -glassmilk" (Geneclean, Bio 101). The purified 
product was end-repaired using Klenow fragment and cut sequentially with EcoRl and BamHl 
restriction enzymes in the appropriate buffers. 

The purified ejnjnW PCR product was ligated into pBR322 that had been cut with 
EcpRl and BamHl. The plasmid was used to transform JL coH strain MC1061 by standard methods. 
Transformants were screened for the presence of plasmids containing inserts of the appropriate size 
on agarose gels. One such plasmid ( P CDM24) was then selected for purification and ligation of emm5 
and ^6 synthetic oligonucleotide pairs which were synthesized as described above according to the 
following sequences: 
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emm5 synthe tic oligonucleotide pairs 

l/2BamHl - • 1/2 ^ 

5' GA TCC GCC GTG ACT AGO GOT ACA ATA AAT GAC CCG CAA G 3' 

3' G CGG CAC TGA TCC CCA TGT TAT TTA CTG GGC GTT CAG CT 5' 
emm6 synth etic oligonucleotide p airs 

1/2 M1 -Ml-l/2XbaIII 

5' TC GAC AGA GTG TTT CCT AGG GGG ACG GTA GAA AAX XXG GAC CTG CAG 3' 

3' G TCT CAC AAA GGA TCC CCC TGC CAT CTT TTG GGC CTG GAC GTC GCC GG 5' 
The emm5 oligonucleotides were mixed in equimolar ratios, heated at 65°C for 2 min 
and allowed to reanneal at ambient temperature. pCCDM24 was cut with BamHl and §ajl and 
purified on an agarose gel as described above. The emm5 oligonucleotide pair was then ligated to the 
purified, cut plasmid and used to transform E. coli strain MC1061. Piasmids containing appropriate 
sized inserts were identified on agarose gels and one (pCDM24.M5) was selected for ligation of the 
emm6 oligonucleotide pairs which was accomplished in a similar manner. The resulting pCDM24.M5- 
•M6 was then purified and cut with EcoRl and fstl, the site for which was synthesized into the emm6 
oligonucleotide pair. The excised emm24.5.6 hybrid gene was then ligated into pKK223-3, a high 
level expression vector that contains the tac promotor and a ribosome binding site adjacent to the 
EcoRl cloning site. 

The construction, cloning and expression of M24-M5-M6-M19 tetravalent hybrid M 
protein was carried out as follows. 

The M24-M5-M6-M19 tetravalent M protein was constructed using fragments of the 
5' regions of emm genes that were amplified by PGR, purified, ligated in tandem and expressed in 
PKK223-3. The overall goal was to amplify the regions of the respective emm genes that encode 
protective and not tissue -cross -reactive epitopes and link them into one protein molecule. The 
recombinant hybrid protein contained 113 amino- terminal amino acids of M24, 58 amino acids of M5, 
35 from M6 and 35 from M19. Each segment was linked by 2 amino acids specified by the respective 
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; ,h*, were synthesized into the oligonucleotide primers used to specify the 
restriction enzyme sites that were syuincan. & 

PCR products. 

The primers for each emm gene were synthesized as described above according to the 

following sequences: 
M24 Top Strand 

-EcoRl- Start 

5' GG GAA TTC ATG GTC GCG ACT AGG TCT CAG 3' 
M24 Bottom Strand 

-BamHl- 

5' GG GGA TCC TTC AAG ATC AGC TTT CTC TGC 3' 

M5 Top Strand 

-BamHl- 

5> GGG GGG GGA TCC GCC GTG ACT AGG GGT ACA 3' 
M5 Bottom Strand 

-Sail- 
s' GGG GGG GTC GAC CTC ACT TTT TAA CCC TTC 3' 

M6 Top Strand 

-Sail- 
s' GGG GGG GTC GAC AGA GTG TTT CCT AGG GGG 3' 

M6 Bottom Strand 

- Nco l- 

5' GGG GGG CCA TGG TAA GTT GTC AAT AAT AGC 3' 
M19 Top Strand 

-Noel- 
s' GGG GGG CCA TGG AGA GTG CGT TAT ACT AGG 3' 
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M19 Bottom Strand 
-Pstl- 

5' GGG GGG CTG CAG AGA TAA CTT CTC ATT CTG 3' 

The M24, M5 and M6 oligonucleotide sequences were based on.previously published 
data. See Bibliography supplied herewith and incorporated herein by reference. The M19 sequence 
was obtained in a similar manner from a plasmid containing the entire etnral9 structural gene. 

The oligonucleotide primer pairs described above were used to amplify the regions of 
each emm gene using chromosomal DNA from the respective serotype of group A streptococci as the 
template in the PCR reaction, as described above. The PCR products were purified by excision from 
low melting point agarose. Because some of the bottom strand PCR primers annealed to regions of 
the emm genes that were repeated, the PCR products were of variable sizes. In each case, the smallest 
major band was excised and purified for ligation. 

Ligation of the purified PCR products was accomplished by first cutting the M24 and 
M5 fragments with BamHl and then ligating the two cut fragments. The ligation mixture was then 
subjected to amplification by PCR using as primers the M24 top strand oligonucleotide and the M5 
bottom strand oligonucleotide in order to amplify only the M24-M5 ligation product. This hybrid 
PCR product were then purified, cut with Sajl and ligated to the M6 fragment that had also been 
digested with Sail. The M24-M5-M6 hybrid was once again subjected 'to PCR amplification using 
the M24 top strand primer and in this case the M6 bottom strand primer. The same sequence of 
events was then followed to ligate the M19 component to make the completed tetravalent gene. The 
purified products was cut with EcoRl and Pstl and ligated into the respective sites of P KK223-3 
which was used to transform & coji strain JM105. The recombinant expressing the hybrid M protein 
was initially identified by screening colony blots on nitrocellulose with rabbit antisera against pep 
M24. 

The constructing of repeating amino- terminal M protein sub-units using PCR was 
carried out as follows. 
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Multimeric anuno-terminal fragments of etna, genes were coastructed using PCR 
am p,ification. Per example, the first !2 amino acids of M19 were expressed as three repeats b y the 
following method: 
PCR Primers: 

MI9 . 5 . rep «»,.. 5 ' AC ^ro ™ T ™t ^ rr r-r *rc, rr> caa oat aag aoa 

GTG CGT TAT ACT AGG 
M10 . 3 . BS . 5. GGG GGG CCA TGG CTT ATC TTC TGG CGT ATG 

M19-5- Monomer - 5' GGG GGG GAA TCC AGA GTC CGT TAT ACT AGG 

The M19-5' repeater encodes the first 12 amino acids of the NH 2 terminus with the 
codons for the first 6 amino acids added to the 3' end of the primer as a repeating sub-unit. M19-3" 
BS copies the complementary strand of codons 7-12 with an N^ol enzyme site and a poiy-G Camp. 
The M19-5' monomer encodes the first 6 amino acids of the NH 2 terminus with an E^Rl-enzyme 

site and a poly-G clamp at the 5' end. 

Template DNA from type 19 streptococci was first amplified with the M19-5' repeater 
and M19-3' BS which resulted in a ladder of PCR products ranging in size from 2 to -6 sub-units. 
One product of the appropriate size (to encode 3 sub-units) was purified from agarose gels and then 
subjected to PCR amplification using the M19-5' monomer and M19-3' BS primers. The single 
product was then ligated into the appropriate restriction sites. 

The sequence analysis of the trivalent and tetravalent hybrid emm genes was carried 

out as follows. 

The structures of the hybrid emm genes described above were confirmed by 
sequencing the inserts in pKK223-3 by the dideoxy- nucleotide chain termination method of Sanger 
et ah. 

The purification of recombinant hybrid M proteins was carried out as follows. The 
trivalent and tetravalent hybrid M proteins were purified from extracts of JM105 E, coH grown 
overnight in 11 L broth supplemented with 75 ug/ml ampici.lin, 25 ug/ml streptomycin and IPTG 
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(ImMol). The cells were pelleted at 7000 xg and resuspeaded in 50 ml carbonate buffer, pH 11.0 
containing 100 ug/ml lysozyme, 1 mMol EDTA and 100 ug/ml PMSF and incubated at 37°C for 30 
min. The cells were centrifuged at 7000 xg and the supernatant was dialyzed against distilled water 
and lyophilized. Purification was performed by loading 50 mg of extract containing either the 
trivalent M protein hybrid onto a preparative PAGE unit (Prep Cell, Model 491, Bio Rad., Inc.) using 
a 37 mm column and a 9 cm 11% polyacrylamide gel. Six ml fractions were collected and assayed for 
the presence of recombinant proteins by Western blot analysis using pep M24 antisera. Peaks 
containing activity were polled and lyophilized. 

Immunization of Rabbits 
Rabbits were immunized with 300 ug of the selected polyvalent (e.g. the trivalent or 
tetravalent) hybrid M proteins emulsified in complete Freund's adjuvant. Booster injections of the 
same dose in PBS were given at 4, 8 and 12 weeks. Blood was obtained prior to immunization and 
at two- week intervals thereafter. The other polyvalent hybrid M proteins are used in the same 
manner. Likewise, a mixture (or "cocktail") of such hybrids are used in the same manner. 

Assays for M Protein Antihndies 
Total antibody activity against M protein was determined by ELISA using native pep 
M proteins, recombinant M proteins, synthetic peptides or purified polyvalent (e.g. trivalent or 
tetravalent) hybrid M proteins as solid phase antigens by methods previously described. Opsonic 
antibodies were assayed by in vitro opsonophagocytosis tests, as described. ELISA-inhibition and 
opsonization- inhibition experiments were performed using purified M proteins or synthetic peptides 
as soluble inhibitors of M protein antibodies in ELISA or opsonization tests. Western blots of 
recombinant proteins were performed using antisera raised in rabbits against synthetic peptides or 
native M proteins. The other polyvalent hybrids were assayed following the same protocol. 
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A«av«; for EWt ■ Cross- P Antibodies 
Aotisera against recombinant multivalent M proteins were screened for the presence 
of heart-cross-reactive antibodies by indirect immunofluorescence tests using frozen sections of 
human myocardium as previously described in the literature. 

a««, y « for M Pr» »™ F pitnnes th «* Mucosal Antibodie s 

Rrnadlv Protective At "'"*' Infection 
Rabbit antisera were screened for the presence of broadly protective antibodies using 
passive mouse protection assays (see Bronze, M.S., et aL, Protective and Heat-Crossreactive Epitopes 
Located within the N-Terminus of Type 19 Streptococcal M Protein, J. Exp^Med., 167, 1849-1859 
(1988). Antisera were first tested for the ability to react with the surface M protein of multiple 
heterologous serotypes of group A streptococci by ELISA. Those that recognized M protein epitopes 
in their native conformations were then used to passively protect mice against intranasal challenge 
infections. Antibodies were absorbed to virulent streptococci and mice were challenged intranasal* 
with 10 7 CPU. Throat cultures were obtained on alternate days and deaths -were counted over the 
ensuing 14 days. Vaccine constructs that evoke protective antibodies in rabbits will be used to 
immu nize mice intranasal* to test directly their protective immunogenicity. Actively immunized 
mice will be similarly challenged with virulent streptococci. 

' An illustration of a tetravalent hybrid gene M24-M5-M6-M19 with different linkers 
than shown in the FIG. 4, is shown in FIG. 6. The sequence of the hybrid gene shows the structure 
of emm24, emm5, emm6 and emml9. The tetravalent emn. gene expresses protein with 
a calculated M.W. of 30.7 kDA and contains 113-amino terminal amino acids of type 24 M protein, 
58 amino acids of type M5, protein 35 amino acids of type 6M protein, and 35 amino acids of type 
19 M protein. The linker is a proline rich linker Pro-G.y- Asn-Pro-Ala- Val-Pro, the codons for 
which are inserted into the fiamHl, §all and restriction enzyme sites, respectively, which were 
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synthesized into the original PCR primers. This linker in part includes the amino acids of the 
restriction site codon at the desired position. 

Other linkers may be used such as for instance a sequence which include amino acids 
inceTIIeTPro-Gly or Asp-Pro-Arg- Val-Pro-Ser-Ser. 

The sequence of the amino acid in any particular linker used appears at this time not 
to be critical. Theoretically, a linker could be constituted by one amino acid; if the desired effect of 
promoting a functionally effective conformation of the encoded protein is desired, longer linkers may 
be selected, such as of having 14 or more (e.g. 20) amino acids. 

As described herein, linkers are not essential to the structure so that it is not necessary 
that any one amino acid sub-unit or for that matter all amino acid by fused to each other by an 
amino acid linker. For an illustration, M24 and M5 can be directly fused to each other. Further, 
while the description herein refers to linkers constituted by amino acids encoded by hybrid genes, 
some hybrid constructs may contain purified recombinant M proteins which may include one or more 
molecules which are not an amino acid, such assuccinimidyl-4-(N-maleimido-methyI) cyclohexane- 
1-carboxylate (SMCC). The linkers can be of same or of varying lengths between each amino acid 
segment. These molecules may be introduced by chemical means as opposed to being expressed with 
the hybrid protein. 

The tetravalent hybrid gene illustrated in FIG. 6, when tested for antigenicity by 
reacting with polyclonal rabbit antisera raised against each one of the components of the hybrid 
protein, will indicate that the epitopes are present in a conformation which resembles that of the 
native protein. 

When the immunogenic^ of the tetravalent protein and the antibody level will be 
determined, the immune sera will also contain opsonic antibodies against ail four serotypes of group 
A streptococci. 

It is contemplated in accordance with the invention that the amino terminal amino acid 
fragment be constituted to contain one or more amino- terminal portions of other potentially 
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t „o.u . » be discovered - have such po.e-ial rta««W* etf«c, Also i. is «—,■«- *« 
lb e cous-rucs of .he in-oUo. b, — » — - « - — «»•— °' •° i ' 0 "™'"' 
regio. o, s„o.yp=s which a„ .o, k.o.u .o hav, such ,b,«u,a.„ g euic .««. - *- 

a .d ta .he >» >— '"< h s "" c, °" " yeI S ' ,Ue "" d " 

su ch se,ue.=. has « ye, b. published, o,e W i. .be a„ by ru,.hods readily available car. 
«,»,«. such s,uc,u,es " *• « * 

Thus ,.e invention con.e.p.a.e. -h .ultivalen. p,o,,i. «C0deo by appropria.e HM* »~ - 
genes ,„ express in a. .PP.opna« organism a pro.ei. *« -« *• **" -•"»*■• 

The effec. of .h« differ... linkers o» .he imeuogenicit, of .be hybrid molecule may 
justify f»r.h.r inv,sti g a.ioos. is ... «c.ud.d .ha. d«p,.di.g oh .he nature of .he linker and of ,h« 
, yp , and six, o, U., amino acid fradons. a hybrid protein of id,,, o, Cos, ,o idea, high immunoge.. 

ici,y be identified. Such hybrid is wiUii. .h. scope of the i.veuion. 

Wh..hasb«eod«,rib,dh.r,i..bo.,aUo.ppli,swi l h„.p,,uo.h,ca,bo, y -.,rr.ioal 

taction or th. Crepeats <b,reof «... such f,actio.(s, or repeats a„ nsed.as described herein. 

» should be kepi i. -tad .ha< .« all - in** »°» ' ° f """""> 
,„« hybrid »«,d be im.u.o 8 ,nic when considered i.di.iduail, (and »i.hou. a carrier) provided .ha. 
„..„ par, of .h. final hybrid .hey contribute .0 ,h, desired immu.oge.icity or ,. leas, do no, de.rac. 

therefrom. 

The sequence of amplified (M-Iike) 2, 3, 18 and 19 M genes is discussed in Podbielski 
Application of the Polymerase Chain Reaction to Study the M Protein(-like) Gene Family in 
Beta Hemolytic Streptococci, Microbiol. 1*^, 180, 213 (1991). Genes of the M12 type 
^2) of a nucleotide sequence of 1693 base pairs is described in Robbins a aL, Stre ptococcus 
P^enesxype 12 Protein Gene Regulation by Upstream Sequences, J^mal of B^iolo^, 5633- 
5640 (Dec. 1987). The NH 2 -terminai sequence of type 1 streptococcal M protein is discussed in Kraus 
et aL Sequence and Type-Specific Immunogenic^ of the Amino-Termina. Region of Type 1 
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Streptococcal M Protein, The Journal of Immuaology , 139, 3084-3090 (Nov. 1987), incorporated by 
reference. 

The NH 2 - terminal fragment is constituted of fragments of 28-kDA, 25-, and 23.5 
kDA. The article discusses similarities and differences with other NH,- terminal M protein sequences. 
Opsonic antibodies are developed against type 1 streptococci. It is noteworthy that the NH 2 - terminal 
region of type 1M protein also retains epitopes that evoke protective immune responses. 

It is therefore within the scope of the invention for the hybrid construct to contain 
NH r terminal regions which also raise protective mucosal responses, (not only opsonic responses) in 
those instances where the NH r terminal region does raise both types. Thus, the carboxy terminal 
fragment is not always necessary for a hybrid to raise mucosal or cellular responses. 

In general, the NH r terminal residues of the different M proteins which show less 
ordered structure and are more variable from one type to another comprises about 10 to 20 residues. 

Another embodiment of the invention is illustrated in Figure 7 which shows a M24- 
M24-M24-M5-M5-M5-M6-M6-M6-M19-M19-M19 multivalent hybrid of 561 nucleotides and a 
calculated M.W. of 21.6 kDA constituted of 187 amino acids with restriction sites between the 
different fragments as shown, respectively BamHl, Sal 1, and Ncol. It will be observed that the 
repeated amino acid fractions of the respective types M24, M5, M6 and M19 (underscored) are of the 
same length. They need not be so. The smaller size of the repeat fragments of the construct is to 
enhance the immunogenicity of the entire molecule as opposed to longer fragments as described 
elsewhere herein and to evoke antibodies against the distal (and most protective and least tissue- cross - 
reactive) epitopes. Each fifteen amino acid sub-unit is repeated three times. Further, such smaller 
amino acid fragments can be more readily synthesized by an amino acid synthesizer or by a novel 
modification of the classic PGR method as described herein. In this embodiment of the invention, 
it will observed that there are no linkers. 

Contemplated within the invention are similar hybrid structures in which the 
individual repeated segments are longer or .shorter than the shown 15 amino acids. Since the 
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individual M fragments constituting the hybrid are when considered alone, of different immunogen- 
ic^, it appears worthwhile to consider increasing or decreasing the length of one or more of such 
fragments to further increase the overall immunogenicity of the molecule. Linkers can also be 
included. 

Further, anyone of the fragments here illustrated in FIG. 1 hereinabove can be 
replaced by another serotype, such as serotype 1, 3 or 18. In this manner, tetravalent hybrid genes 
can be constructed and the corresponding hybrid protein expressed. Likewise longer, such as penta-, 
hexa-, octa-, nona- or decavalent hybrid genes and the corresponding expressed proteins can be 
obtained. 

In that connection, it is noteworthy that as opposed to increasing the number of amino 
acid fragments constituting a particular multivalent vaccine, such as to octa-, nona- or deca- 
multivalent vaccine, it would be more advantageous to provide a mixture of smaller constructs, the 
mixture being constituted by at least 2 of such constructs. In this manner, it may be a mixture of 
"cocktail" of multivalent vaccines can be provided which would have an optimum maximum length 
(or size) such as tetra- or pentavalent structure and another one of approximate similar length and 
constituted of other serotypes. For instance, a multivalent hybrid vaccine constituted of M24-M5- 
M6-M19 can be provided in admixture with one containing M1-M3-M18 and a further admixture 
with a multivalent hybrid vaccine constituted of M1-M24-M5 and yet a fourth one containing any 
one of those above mentioned including M18. It is to be observed that the order (sequence) in which 
these M protein fragments have been described herein is not necessarily the sequence in which the 
invention is limited as has been described repeatedly herein. 

Another interesting illustration of a tetravalent hybrid gene of the invention is 
illustrated in Figure 8 which shows the sequence of tetravalent M24-M5-M6-M19 with the carboxy 
terminal half of MS joined at restriction Pstl. The hybrid construct has 305 amino acids expressed 
by the 915 nucleotides. The restriction sites are shown. 
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It will be observed that the tetravalent amino acid fragments are each of 15 amino 
acids long and joined directly to each other without the intermediary of amino acid linkers. This is 
an illustration of the concept of the invention combining the multivalent vaccine with that of a 
carboxy terminal of one of the M protein serotypes. 

Instead of using the carboxy terminal of M5 any other M-COOH may be used such 
as that of M24, M19 and M6. Care will be taken of course that the carboxy terminal not be one that 
would generate undesirable antibodies such as tissue -cross -reactive antibodies. In the illustrated 
construct, not only are there generated opsonic antibodies against the four M protein fractions, but 
also protective mucosal antibodies against the carboxy terminal portion of the molecule. As explained 
herein, such a structure has distinct advantages in that in can serve as a vaccine in controlling nasal 
or other infections often preceding rheumatic fever. 

A vaccine constituted of the construct illustrated herein, or a similar one is therefore 
an effective, therapeutic prophylactic agent which interesting enough may be administered nasally 
as by spray. 

Instead of using the entire carboxyl terminal of anyone of the M -serotypes, it may be 
advantageous to use only one or more amino acid of the C repeats of the carboxyl terminal of a 
particular serotype. It is noteworthy that the carboxy terminal or the amino acid constituting one or 
more C-repeats used in the construct need not be one of the same serotype(s) as that which constitutes 
the amino terminal portion of the construct. Thus, such vaccine will provide cellular immune 
responses (which normally are less type-specific or more cross- reactive) than the opsonic response 
and concurrently provide type-specific immunity. 

It should be noted in conjunction with the invention as has been described herein, that 
not all M protein epitopes are sero-specific in their amino terminal portion of the molecule. Some 
epitopes of particular serotypes, such as M5 do cross- react to some extent with streptococci of a type 
other than M5, such as M6 or M19. And to some extent this also occurs with other M serotypes. 
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Accordingly, it is within the scope of this inveotioo that when sero-specificity is referred to, this does 
not exclude some cross- reactivity between certain shared structures. 

However, such shared epitopes are often also most likely to cross-react with heart 
tissue and hence present potential serious risks and are not opsonic or do not evoke antibodies with 

a high level of opsonic activity desirable. 

In another embodiment of the invention, the sequence of the amino terminal amino 
acids of the various fragments has been co-expressed in a different order. A comparison between on 
one hand, the structures of FIGS. 4 and 6 and on the other hand, FIG. 9 will show that the DNA 
recombinant nucleotide sequence and the deduced amino acid sequence of the tetravalent hybrid 
M19-M6-M5-M24 is in the reverse order than in the other above-mentioned constructs. The 
construct of Figure 9 illustrates a multivalent vaccine having 247 amino acids expressed from a 
nucleotide sequence of 741 DNA nucleotides. The respective M19, M6. M5 and M24 fragments are 
of the following respective amino acids length: 35, 35, 58 and 113. 

In studies related to the invention, it had been found that the immunogenicity of 
certain sub-units or fragments is greater than others. For instance, M24 is greater than M5, which 
in turn is greater than M6 and which in turn is greater than M19. It was also observed that the 
immunogenicity against the amino terminal M24 and M5 sub-units in a construct containing M24- 
M5-M6-M19 was greater than against the M6 and M19 components. However, it was of interest in 
conjunction with this invention to determine whether such postulates would hold true in the 
constructs of the invention, in particularly, in a total reversal of the order of the amino acid sub- units 
as is shown in Figure 9 or for that matter, in the rearrangement of some of these sub-units in any 
order desired such as M19 followed by M6 and then followed by M24 and then by M5. 

The same remarks apply with respect to the other M protein serotypes as 3, 12 and 18. 
Indeed it is not seen at this time why organizing the sequence of amino acids by increasing (or 
decreasing) immunogenicity should apply to the constructs of the invention. In that sense, the 
vaccines of the invention may present another interesting departure from the conventional. 
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The tetravalent protein reacted with polyclonal rabbit antisera raised agaiast each 
component of the hybrid protean as described herein. Thus, indicating that the epitopes were present 
in a conformation which mimicked the native protein. Rabbits immunized with a purified tetravalent 
M Pr0tCin arC eXpected t0 devel °P significant antibody levels against the tetravalent vaccine of all 
four serotypes of the purified native M proteins. By changing the order of the amino acid fragments 
in the tetravalent hybrid, different levels of opsinization can be observed. 

As described herein, when linkers are used, linkers which are of particularly interest 
are constituted to be overall hydrophobic i.e. they are constituted by a multiplicity of amino acids 
with non.polar groups. Included in such amino acids are those with aliphatic groups, such as alanine, 
leucine, isoleucine, valine and proline; with aromatic rings like phenyalaine and tryptophan, and 
methionine. It is in accordance with the invention that other of the 22 amino acids may be considered 
with the hydrophobic acids or not to form the appropriate linkages in those hybrid construct where 
linkages are desired. 

An illustration of a divalent hybrid gene emm24 and emn,5 and the amino acid 
sequence is shown in FIG. 10 of 522 nucleotides has expressing the hybrid protein of 174 amino acid. 
The restriction site Bam Hl is shown. 

Opsonic antibodies are elicited against the respective native M proteins, Immunogen- 
icity is tested in rabbits. 

An illustration of a tetravalent gene of the invention in FIG. 11 shows the sequence 
M19-M6-M5-M24 in the reverse order than shown in FIG. 8 with two and a half Crepeats of the 
M5-carboxy« terminal. The 1029 nucleotide long shows the restriction sites between the respective 
fragments and the fused Crepeats of a total of 280 nucleotides long of the carboxy. terminal amino 

acid region of MS. 

This construct is noteworthy in that it will elicit not only antibodies against the M6- 
M19-M24-M5, but also against the Crepeats. Each complete Crepeat is 35 amino acids long, the 
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What is of particular interest in this construct of the invention is that in addition to 
the sub-units from rheumatogenic end terminal fractions, a non-rheumatogenic serotype of 
streptococci was encoded and expressed, namely, that of M12. 

This is an important aspect of the invention which is not limited to a particular non- 
rheumatogenic M12 serotype of streptococci as illustrated. Indeed, instead of a fraction of M12 an 
appropriate fraction such as of 10, 12 or more amino acids of any of the non-rheumatogenic serotypes 
can be used such as the following: 2, 10, 8, 9, 11, 22, 33 and others. 

While a certain number of the rheumatogenic type M proteins have not yet been 
sequenced or their sequence not yet disclosed, such sequencing is readily performed by methods 
known in the art and hence appropriate fractions free of epitopes that cross- react with human tissue 
be made to constitute the multivalent hybrid of the invention. It is also contemplated that more than 
one fraction of the rheumatogenic serotype of streptococci be contained in the multivalent vaccine. 
When the antigenicity of this octavalent hybrid protein is tested, it reacts with polyclonal rabbit 
antisera raised against each component of the hybrid protein. Its immunogenicity and lack of cross- 
reactivity with human tissue especially myocardium, can be tested in accordance with the assays 
described herein. Thus, the immune sera will obtain opsonic antibodies against all seven types of 
serotypes of group A streptococci. The immunogenicity of the tetravalent protein and the antibody 
level will be determined. The immune sera will also contain opsonic antibodies against all four 
serotypes of group A streptococci will be tested for bacterial activity in vitro on an in vivo passive 
mouse protection tests as described. The multivalent protein is assayed in accordance with the assay 
described herein for the presence of broadly protective antibodies using passive mouse protection 
assays. In this case actively immunized mice will be similarly challenged with the virulent 
streptococci. 

Thus in accordance with the invention, a multivalent hybrid M protein is provided 
which provides broad immunity against several serotypes and also elicit protective mucosal immunity. 
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It is also the contemplation of the invention to have a multivalent vaccine constituted 
of two different serotypes followed by a non-rheumatogenic fraction as was illustrated herein above, 
for example in conjunction with FIG. 13. Likewise, it should be noted that it is conceivable that the 
hybrid gene and hence the expressed hybrid protein have the non-rheumatogenic serotype of the 
streptococcus as the first fraction upstream of the rheumatogenic amino acid fractions of the amino 
terminal portions of the respective serotypes of streptococci. Further, it is within the contemplation 
of the invention to construct hybrid genes and the expressed hybrid protein constituted of one 
rheumatogenic serotype, such as M24-M5-M6-M19-M1-M3.M18 or others followed or preceded by 
a non-rheumatogenic fraction of a serotype of streptococci, such as M12 or others. 

Of particular interest in conjunction with the invention are vaccines which include 
amino acid sub-units of any or all of the 1 through 80 different serotypes known or to be discovered 
of which approximately 15 are known to be causative or at least to contribute in the development of 
acute rheumatic fever following strep throat. 

It is important to note that the invention is not limited to a particular amino acid 
sequence wherever herein amino acid sequences are referred to or described. In any particular amino 
acid sequence or fragment referred to herein, any one or more of amino acids can be rem ved, 
substituted, i.e. replaced by some other amino acid(s) as long as the desired epitopes are not adversely 
affected by such changes in the structure of the amino acid. Indeed this is quite commonly found in 
that the amino acid sequence of certain types of M proteins such as type M5 which originates from 
different strains of M5 (and also come from different origins and/or at different times) may have 
different amino acids substitutions, i.e. constitution. This has been shown for several such M 
proteins. Reference to that effect should be made to Miller et aL, J. Biol. Chem. , 263: 5668 (1988) 
•Antigenic Variation Among Group A Streptococcal M Proteins: Nucleotide Sequence of the Serotype 
5M Protein Gene and its Relationship with Genes Encoding Types 1, 6 and 24 Proteins* and also see 
Dale et aL, J Exp. Med. . 163:1191-1202 (1986), "Localization of Protective Epitopes of the Amino 
Terminus of Type 5 Streptococcal M Protein". 
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As has been described the N-terminal segment which ,s free of tissue cross-reactive 
epitopes can range from 10 to 115 am.no acids and as an average be about 35 amino acids depending 
on the particular M protein type. 

Accordingly, any single fragment of sub-unit constituting the hybrid gene and hence 
the expressed hybrid protein can be constructed to have a number of amino acid substitutions so as 
to contains such amino acid substitutions from one strain and for instance, two or more substitutions 
of another strain of the same serotype. Thus, functionally the antibodies generated would react 
optimally from and with all the strains of the particular type 5M protein. 

It is therefore an important concept in this invention that when reference is made to 
a particular serotype (i.e. of anyone of the known or to be discovered serotypes e.g., l- 82 ) reference 
is not intended to one single type or strain such as that of M5.M6-M19-M24, but to the various 
strains of such serotypes which may as described, have amino acid variants. Thus, not only is the 
fundamental concept of the invention to provide a mu.tivalent vaccine against different serotypes, 
but also different strains within that particular serotype. 

Likewise, the nucleotide sequences can be so modified to code for the desired 
immunobiologically functionally equivalent amino acid sequence, Similarly, it is within the scope 
of the invention that due to degeneracy of the genetic code DNA sequences be constructed or used 
that encode and express the desired amino acid fragments in a selected organism transformed (or 
transfected) with the selected self replicating vehicle. 

As has been described herein the invention is not limited to a particular maximum of 
mu,tiva,ent hybrid gene or expressed multiva.ent vaccine by any specific number of serotypes. Since 
however there appear to be practical limits, i, had been suggested that cocktail or mixture of 
appropriately sized multivalent hybrids be constructed. 

Another aspect of the present invention are hybrid or fusion genes 
which have been constructed which encode the antigens of the present invention. The fusion genes 
code for the antigens of the invention, constituted as described above, of amino acid fragments linked 
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to the selected carrier. The genes are inserted into suitable self- replicating vehicles, like plasmids. 
The plasmids containing the genes are then used to transform nonvirulent microorganisms. The 
transformed microorganisms express the hybrid or fusion protein antigens which are capable of 
eliciting opsonic and/or protective antibodies against serotypes of Group A streptococcus in 
itnmunized mammals, without eliciting cross -reactive antibodies to mammalian heart tissue antigens. 

The compositions of the invention can be administered by any suited route, including 
orally and nasally. They can be dispersed in an appropriate propellant, as for nasal administration. 

The therapeutic compositions of the present invention may also be administered 
parenteral^. Mammals, in particular humans, immunized parenteral^ with a sufficient amount of 
the therapeutic composition of the present invention develop opsonic and/or protective antibodies 
directed to the epitopes of the hybrid streptococcal M protein antigen. Non-limiting examples of 
such parenteral routes of administration are intracutaneous and intramuscular. 

For intracutaneous injection, 100-300 yg of hybrid antigen emulsified in complete or 
incomplete Freund's adjuvant was administered in a mammal. A booster injection of about the same 
dose in saline was administered about one month later. Blood was obtained prior to the first injection 
and at two- week intervals thereafter for eight weeks. 

A topical method of administration is also provided, namely intranasal. For intranasal 
administration, a mammal received about 50 M g to about 10 mg of purified antigen in an appropriate 
diluent for administration. Such method may be particularly well suited when the vaccine is 
constructed to evoke secretory or mucosal immunity since nasopharyngeal infection is a common 
infection in humans. 

In accordance with the invention, the therapeutic composition may be administered 
singly in series or advantageously in a mixture or cocktail of multiple compositions to elicit broad 
spectrum immunity versus Group A streptococci. 
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bacteri al hose .nciude il^. i-U^ S. mutans and S; sanguis. 

Particular.y preferred are S^. sanguis, S. mutans, which is non-cariogenic. 

Additional appropriate microorganisms which may be attenuated and transformed in 
accordance with the invention are known. Reference may be made to Davis, et aK, Mi^bioJ^. 

(Harper & Row, Second edition, 1973). 

Generally any ...eric b.cerinm may serve a S .be hos. baceriurn. t. is praf.r.b.. .ha. 
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to cytoplasmic compartment. 

[n accordance with the present invention, general molecular biology methods are used. 
Any suitable p.asmid or bacteriophage cloning vector may be selected. The vector should have an 
origin of replication that is functional in the intended microbial host cells, and a selectable marker 
(such as an antibiotic resistance gene) to aid in identification of host cells that have been transformed 
with the vector. It should be able to accept inserted DMA fragments and still replicate normally. 
Preferably, the vector comprises one or more unique restriction endonuclease recognition sites at 
which hybrid DNA fragments can be inserted without destroying the vector's ability to replicate. 

Suitable cloning vectors include phage derivatives such as lambda g.ll (Young and 
Davis, Proc. Nad. Aca<L ScL USA. 80, 1194-1198 (1983)), the various phage M13-derived vectors 
such as M13m P 9 (commercially available from Bethesda Research Labs), plasmids such as pBR322, 
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and many others (Old and Primose, Principles of Gene Manipulation . 2nd Ed., University of 
California, Press, 32-35 and 46-47 (1981)). The streptococcal DNA is inserted into the cloning vector 
by such standard methods as homopolymeric tailing or by using linker molecules. 

In connection with the invention as described, the PCR method and other molecular 
biology and immunology methods and materials are used. But for the method for synthesizing 
particular fragments in the pre-selected sequence and orientation, the PCR method and other 
materials used herein are described in several general standard texts and laboratory manuals. For 
instance, Sambrook, section 14, in vitro Amplification of DNA by PCR; Ausbel Protocols Molecular 
Biology, Section 15; for protein expression see same, section 16; for prokaryote and eukaryote 
expression vectors, see Sambrook, Section 1.7; Protocols Molecular, Section 1, e.g. E. coli Plasmids, 
listing numerous available plasmids. For other suitable vectors for molecular cloning, see Perbal (2nd 
Ed.), Section 6, which lists for instance, cloning vectors desired from pBR322 (used herein). For 
material, protocols, etc. in immunology, see in general Current Protocols, Immunology; also see 
Section 7 for Immunologic Studies in Humans and Section 8 for Isolation and Analysis of Proteins. 
The ATCC Catalogue of Bacteria and Phages lists suitable microorganisms. For a catalogue of Yeasts, 
see ATCC Catalogue of Yeast (1990), 18th Ed. For available Recombinant DNA Materials (Hosts, 
Libraries, Vectors, Clones, etc.), see ATCC catalogue of Recombinant DNA Materials 2nd Ed. (1991). 

Another worthwhile publication is Immunology of Proteins, Atassi (vol. 3), Pieniurn 

Press (1979). 

This invention makes a significant contribution to the medical arts. It is contemplated 
to be within the scope of the invention that substantially the same results be obtained by 
substantially the same means operating or performing in substantially the same manner as described 
herein. 

One skilled in the art may refer to the below listed bibliography which is incorporated 
herein by reference. 
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We claim: 

1. An immunogenic recombinant multivalent hybrid M protein which comprises 
amino acid fragments of streptococcal M protein that carry epitopes that elicit opsonic antibodies 
against multiple serotypes of Group A streptococci, said protein not eliciting tissue cross- reactive 

antibodies. 

2. The immunogenic recombinant multivalent hybrid M protein of claim 1, 
wherein the amino acid fragments are amino- terminal fragments of the streptococcal M protein. 

3. The immunogenic recombinant multivalent hybrid M protein of claim 2, which 
hybrid M protein also elicits mucosal antibodies. 

4. The immunogenic multivalent hybrid M protein of claim 2, wherein the amino 
acid fragments comprise amino- terminal portions of serotypes of potentially rheumatogenic 
streptococci serotypes. 

5. The immunogenic recombinant multivalent hybrid M protein of claim 4, 
wherein the serotypes are selected from the group of serotypes Ml, M3, M12, M18, M19 and M24. 

6. The immunogenic recombinant multivalent hybrid M protein of claim 5, which 

is tetravaient. 
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7. The immunogenic recombinant multivalent hybrid M protein of claim 5, which 

is trivalent. 

3. The immunogenic recombinant multivalent hybrid M protein of claim 7, 
wherein the amino acid fragments are selected from the group consisting of M24, M5 and M19. 

9. The immunogenic recombinant multivalent hybrid M protein of claim 6, 
wherein the amino acids are selected from the group consisting of M24 f M5, M6 and M19. 

10. The immunogenic recombinant multivalent hybrid M protein of claim 6, 
wherein the M24 fragment is 113 amino acids long, the M5 fragment is 58 amino acid long, the M6 
is 35 amino acid long and the M19 fragment is 35 amino acid long. 

11. The immunogenic recombinant multivalent hybrid M protein of claim 7, 
wherein the M24 fragment is 260 amino acid long, the M5 fragment is 11 amino acid long and the M6 
fragment is 12 amino acid long. 



12. The immunogenic recombinant multivalent hybrid M protein of claim 3, which 
comprises amino acids of the C-repeats of the M protein. 

13. The immunogenic recombinant multivalent hybrid M protein of claim 12, 
which comprises a carboxyl terminal fragment of an M protein. 

14. The immunogenic recombinant multivalent hybrid M protein of claim 1, 
wherein amino acid fragments are fused to each other by a linker comprising amino acids. 
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15 . The therapeutic recombinant multivalent hybrid M protein of claim 14, 
wherein the linkers of the amino acids are selected from the group consisting of Arg. Ser, Val, Asp, 
Pro, and Trp. 

16. Aa immunogenic composition which comprises a therapeutically acceptable 
carrier and the protein of claim 1. 

17. Amethodforimmunizingamammalagainststreptococciinfectionsbyeliciting 
opsonic antibodies to a streptococcal M protein without eliciting antibodies which are cross- reactive 
with hear, tissue antigens which comprises administering to said mammal an amount effective to 
confer immunity against group A streptococci infections, the therapeutic composition of Cairn 16. 

18. A hybrid recombinant DNA which comprises fused nucleotide sequences of 
streptococcal M protein which encode amino acid fragments of streptococcal M protein that carry 
epitopes that elicit opsonic antibodies against multiple serotypes of group A. streptococci. 

19. The hybrid recombinant DNA of claim 18 which comprises also nucleotide 
sequences which encode C-terminal amino acids of the M protein which evoke mucosal antibodies. 

20. The hybrid recombinant DNA of claim 19, wherein the nucleotide sequence 
encodes amino acids of the C-terminal repeats of the M protein. 
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Figure 1 

RECOMBINANT M24-M5-M6 

10 20 30 40 SO 60 

ATG GTC GCG ACT AGC TCT CAG ACA GAT ACT CTG GAA AAA GTA CAA GAA CGT GCT~~GAC~A~A"C~ 

Val Ala Thr Arg Ser Gin Thr Asp Thr Leu Glu Lys Val Gin Glu Arg Ala Asp Lys 

M2 4 ► 

70 80 90 100 110 120 

TTT GAG ATA GAA AAC AAT ACG TTA AAA CTT AAG A AT AGT GAC TTA AGT TTT AAT AAT AAA 

Phe Glu lie Glu Asn Asn Thr Leu Lys Leu Lys Asn Ser Asp Leu Ser Phe Asn Asn Lys 

130 140 150 160 170 . . 180 

GCG TTA AAA GAT CAT AAT GAT GAG TTA ACT GAA GAG TTG AGT AAT GCT AAA GAG AAA CTA 
Ala Leu Lys Asp His Asn Asp Glu Leu Thr Glu Glu Leu Ser Asn Ala Lys Glu Lys Leu 

190 200 210 220 230 240 

CGT AAA AAT GAT AAA TCA CTA TCT GAA AAA GCT AGT AAA ATT CAA GAA TTA GAG GCA CGT 
Arg Lys Asn Asp Lys Ser Leu Ser Glu Lys Ala Ser Lys He Gin Glu Leu Glu Ala Arg 

2S0 " 260 270 280 290 300 

AAG GCT GAT CTT GAA AAA GCA TTA GAA GGC GCA ATG AAT TTT TCA ACA GCG GAT TCA GCT 
Lys Ala Asp Leu Glu Lys Ala Leu Glu Gly Ala Met Asn Phe Ser Thr Ala Asp Ser Ala 

310 320 330 340 350 360 

AAA ATC AAA ACC TTA GAA GCA GAG AAA GCT GCT TTA GCG GCA CGT AAG GCT GAT CTT GAA 
Lys lie Lys Thr Leu Glu Ala Glu Lys Ala Ala Leu Ala Ala Arg Lys Ala Asp Leu Glu 

370 380 390 400 410 420 

AAA GCA TTA GAA GGC GCA ATC AAC TTT TCA ACA GCG GAT TCA GCT AAA ATC AAA ACC TTA 
Lys Ala Leu Glu Gly Ala Met Asn Phe Ser Thr Ala Asp Ser Ala Lys lie Lys Thr Leu 

430 440 450 460 470 480 

GAA GCA GAG AAA GCT GCT TTA GAG GCA CGC CAG GCT GAA CTT GAA AAA GCA TTA GAA GGC 
Glu Ala Glu Lys Ala Ala Leu Glu Ala Arg Gin Ala Glu Leu Glu Lys Ala Leu Glu Gly 

490 500 510 520 530 S40 

GCA ATG AAT TTT TCA ACA GCG GAT TCA GCT AAA ATC AAA ACC TTA GAA CCA GAG AAA GCT 
Ala Met Asn Phe Ser Thr Ala Asp Ser Ala Lys He Lys Thr Leu Glu Ala Glu Lys Ala 

550 560 570 580 590 600 

GCT "TTA GCG GCA CGT AAG GCT GAT CTT GAA AAA GCA TTA GAA GGC GCA ATG AAC TTT TCA 
Ala Leu Ala Ala Arg Lys Ala Asp Leu Glu Lys Ala Leu Glu Gly Ala Met Asn Phe Ser 

€10 620 630 640 650 660 

ACA GCG GAT TCA GCT AAA ATC AAA ACC TTA GAA GCA GAG AAA GCT GCT TTA GAG GCA CGC 
Thr Ala Asp Ser Ala Lys He Lys Thr Leu Glu Ala Glu Lys Ala Ala Leu Glu Ala Arg 

670 680 690 700 710 720 

CAG GCT GAA CTT GAA AAA GCA TTA GAA GGC GCA ATG AAT TTT TCA ACA GCG GAT TCA GCT 
Gin Ala Glu Leu Glu Lys Ala Leu Glu Gly Ala Met Asn Phe Ser Thr Ala Asp Ser Ala 

7 30 740 750 760 770 780 

AAA ATC AAA ACC TTA GAA GCA GAG AAA GCT GCT TTG GAG GCA GAG AAA GCT GAT CTT GAA 
Lys lie Lys Thr Leu Glu Ala Glu Lys Ala Ala Leu Glu Ala Glu Lys Ala Asp Leu Glu 

Baa Bl 790 800 810 820SaI 1 830 840 
GGA TCC GCC GTG ACT AGG GGT ACA ATA AAT GAC CCC CAA GTC GAC AG A GTC TTT CCT AGG 
Arg Ser Ala Val Thr Arg Gly Thr He Asn Asp Pro Gin Val Asp Arg Val Phe Pro Arg 
M5 - ► M6- 

850 860 
GCG ACG GTA GAA AAC CCG GAC 
Gly Thr Val Glu Asn Pro Asp 
»» 
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Figure 2 



IMMUNOBLOT ANALYSIS OF 
M24.M5.M6 HYBRID VACCINE 
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Figure 4 

RECOMBINANT M24-M5-M6-M19 

M24 ► 2Q 30 40 so so 

ATG GTC GCG^CT AGG TCT CAG ACA GAT ACT CTG GAA AAA GTA CAA CAA OCT GCT GAC AAG 
Met Val Ala The Arg Mr Gin Thr Asp Thr Leu Clu Lys v.l Gin Glu Arg Ala Asp Lys 

70 80 90 100 U0 • 120 

TTT GAG ATA GAA AAC AAT ACG TTA AAA CTT AAG AAT ACT GAC TTA ACT TTT AAT AAT AAA 
Phe Glu He Glu Asn Asn Thr Leu Lys Leu Lys Asn Ser Asp Leu Ser Phe Asn Asn Lys 

130 1« ISO 160 170 180 

GCG TTA AAA GAT CAT AAT GAT GAG TTA ACT GAA GAG TTG AGT AAT GCT AAA GAG AAA CTA 
S LI £ Z Mis Asn Asp Glu Leu Thr Glu Glu Leu Ser Asn Ala Lys Glu Lys Leu 

190 200 210 220 230 240 

CGT AAA AAT GAT AAA TCA CTA TCT GAA AAA GCT AGT AAA AAT CAA GAA TTA GAG CCA CGT 
S £ Z Z ser Leu Ser Glu Lys Ala Ser Lys Asn Gin Glu Leo Glu Ala Arg 

250 260 270 280 2 90 300 . 

AAG GCT GAT CTT GAA AAA GCA TTA GAA GGC CCA ATG AAT TTT TCA ACA GCG GAT TCA GCT 
Lys Ala Asp Leu Glu Lys Ala Leu Glu Gly Ala Met Asn Phe Ser Thr Ala Asp Ser Ala 

BamH1 M5 -«> 

310 320 330 340 350 360 

AAA ATC AAA ACC TTA GAA GCA GAG AAA GCT SAT CTT GAA GCA TCC OCC GT6 ACT AGG GGT 
Lys lie Lys Thr Leu Glu Ala Glu iys Ala Asp Leu Glu Gly Ser Ala Val Thr Ar<j Gly 

370 380 390 400 410 420 

ACA ATA AAT GAC CCG CAA AGA GCA AAA GAA GCT CTT GAC AAG TAT GAG CTA GAA AAC CAT 
Thr lie Asn Asp Pro Gin Arg Ala Lys Glu Ala Leu Asp Lys Tyr Glu Leu Glu Asn His 



«0 «0 «50 460 470 480 

GAC TTA AAA ACT AAG AAT GAA GGG TTA AAA ACT GAG AAT GAA GCG TTA AAA ACT GAG AAT 
Asp Leu Lys Thr Lys Asn Glu Gly Leu Lys Thr Glu Asn Glu Gly Leu Lys Thr Glu Asn 

"O 53 " M6 5~ ^40 

GAA GGG TTA AAA ACT GAG AAT GAA GGG TTA AAA ACT GAG GTC GAC AGA GTC TTT CCT AGG 
Glu Gly Leu Lys Thr Glu Asn Glu Gly Leu Lys Thr Glu Val Asp Arg Val Phe Pro Arg 

550 560 570 580 590 600 

GCG ACG GTA GAA AAC CCG GAC AAA GCA CGA GAA CTT CTT AAC AAG TAT GAC GTA GAG AAC 

Gly Thr Val Glu Asn Pro Asp Lys Ala Arg Glu Leu Leu Asn Lys Tyr Asp Val Glu Asn 

630 NC ° 1 M19 65T* 

TCT ATC TTA CAA CCT AAT AAT GAC AAC TTA CCA TGG AGA CTG CGT TAT ACT AGG CAT ACG 
Ser Met Leu Gin Ala Asn Asn Asp Asn Leu Pro Trp Arg Val Arg Tyr Thr Arg His Thr 

670 680 690 700 710 720 

CCA GAA GAT AAC CTA AAA AAA ATT ATT GAC GAT CTT GAC GCA AAA GAA CAT GAA TTA CAA 
Pro Glu Asp Lys Leu Lys Lys He He Asp Asp Leu Asp A La Lys Glu His Glu Leu Gin 

730 740 
CAA CAG AAT GAG AAG TTA TCT 
Gin Gin Asn Glu Lys Leu Ser 
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IMMUNOBLOT ANALYSIS OF 
rM24.M5.M6.M19 HYBRID VACCINE 
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Figure 6 

M24-H5-H6-U19 (IXKESR VARIAHT) TrwnTnt«d Saqaftnc* 

Sequence Range: 1 to 822 

10 20 30 40 SO 60 

ATG GTC GCG ACT AGG TCT CAC ACA GAT ACT CTG GAA AAA GTA CAA GAA CGT GCT GAC AAG 
Mqc Val Ala Thr Arq Ser GLq Thr Asp Thr Leu Glu Lys Val Gin Glu Arg Ala Asp Lys> 

ft^H To 80 ^ 90 100 HO 120 

t * * 

TTT GAG ATA GAA AAC AAT ACG TIA AAA CTT AAG AAT AGT GAC TTA AGT TTT AAT AAT AAA 
Phe Glu lie Glu Asn Asn Thr Leu Lys Leu Lys Asn ser Asp Leu Ser Phe Asn Asn Lys> 

130 140 ISO 160 170 160 

GCG TTA AAA GAT CAT AAT GAT GAG TTA ACT GAA GAG TTG AGT AAT GCT AAA GAG AAA CTA 
Ala Leu Lys Asp His Asn Asp Glu Leu Thr Glu Glu Leu Ser Asn Ala Lvs Glu Lvs Leu> 

190 200 210 220 230 240 

* • * * * * 
CGT AAA AAT GAT AAA TCA CTA TCT GAA AAA GCT AGT AAA AAT CAA GAA TTA GAG CCA CGT 
Acq Lvs Asn Asp Lvs Sec Leu Ser Glu Lys Ala Ser Lys Asn Gin Glu Leu Glu Ala Arq> 

250 260 270 280 290 300 

* * * * * • 
AAG GCT GAT CTT GAA AAA GCA TTA GAA GGC GCA ATG AAT TTT TCA ACA GCG GAT TCA GCT 
Lys Ala Asp Leu Glu Lys Ala Leu Glu Gly Ala Met Asn Phe Ser Thr Ala Asp Ser Ala> 

310 320 330 340 350 360- 

* * * « * • 
AAA ATC AAA ACC TTA GAA GCA GAG AAA GCT GAX CTT GAA GGA TCC CCA GGA AAC CCA GCT 
Lvs He Lys Thr Leu Glu Ala Glu Lys Ala Asp Leu Glu Gly Ser Pre Gly Asn Pro Ala> 

&£^/ cJWK££ 

370 380 390 400 410 420 

* * * * * » 

GTT CCA GGA TCC GCC GTG ACT AGG GGT ACA AIA AAT GAC CCG CAA AGA GCA AAA GAA GCT 
Val Pro Gly Ser Ala Val Thr Arg Gly Thr lie Asn Asp Pro Gin Arq Ala Lys Glu Ala> 

430 440 450 460 470 480 

* • « * * » 

CTT GAC AAG TAT GAG CTA GAA AAC CAT GAC TTA AAA ACT AAG AAT GAA GGG TTA AAA ACT 

lieu Asp Lys Tyr Glu Leu Glu Asn His Asp Leu Lys Thr Lys Asn Glu Gly Leu Lys Thr> 

490 500 510 520 530 540 

* * « « * « 

GAG AAT GAA GGG TTA AAA ACT GAG AAT GAA GGG TTA AAA ACT GAG AAT GAA GGG TTA AAA 
Glu Asn Glu Gly Leu Lys Thr Glu Asn Glu Gly Leu Lys Thr Glu Asn Glu Gly Leu Lys> 

550 560 570 5S0 590 600 

* « • * * * 

ACT GAG GTC GAC CCA GGA AAC CCA GCT GTT CCA GTC GAC AGA GTG TTT CCT AGG GGG" ACG 
Thr Glu Val' Asp Pro Gly Asn Pro Ala Val Pro Val Asp Arq Val Phe Pro Arq Gly Thr> 

610 620 630 640 ^ 650 * 660 

* • * * * ■ 

GTA GAA AAC CCG GAC AAA GCA CGA GAA CTT CTT AAC AAG TAT GAC GTA GAG AAC TCT ATG 
Val Glu Ami Pro Asp Lys Ala Acg Glu Leu Leu Asn Lys Tyr Asp Val Glu Asn Ser Met> 

670 680 690 700 710 720 

* » « « * * 

TTA CAA GCT AAT AAT GAC AAG TTA CCA TGG CCA GGA AAC CCA GCT GTT CCA CCA TGG AGA 
Leu Gin Ala Asn Asn Asp Lys Leu Pro Trp Pro Cly Asn Pro Ala Val Pro Pro Trp Arq> 

Alu>t L-Z*/&e7L — /J co I 

730 740 750 760 770 780 

* * * *• « « 

GTG CGT TAT ACT AGG CAT ACG CCA GAA GAT AAG CTA AAA AAA ATT ATT GAC GAT CTT GAC 
Val Arq Tyr Thr Arq His Thr Pro Glu Asp Lys Lou Lys Lys He lie Asp Asp Leu Asp> 

^ 

790 800 810 620 

i * ft * 

K^n AAA CiAA CAT GAA TTA CAA CAA CAG AAT GAG AAG TTA TCT 
Ala Lys Glu His Glu Leu Gin Gin Cln Asn Glu Lys Leu Ser> 
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Figure 7 

M24-M5-M6-K19 (SUBUHXT VAR> Xranalatad S«quano« 

Sequence Ranqe; i to 561 

10 20 30 40 50 60 



ATG GTC GCG ACT AGG TCT CAG ACA GAT ACT CTC GAA AAA GTA CAA GAA GTC GCG ACT ACG 
Mat Val Ala Thr Arq S«r Gin Thr Asp Thr Leu Glu Lys val Gin Glu Val 

70 ao 90 100 110 / 120 



Ala Thr Arq> 



TCT CAG ACA GAT ACT CTG GAA AAA GTA CAA GAA GTC GCG ACT AGG TCT CAG ACA GAT ACT 
Ser Gin Thr Asp Thr Leu Glu Lys Val Gin Glu Val Ala Thr Arg ser Gin Thr Asp Thr> 



130 140 160 1*70 180 

« * • * * * 

CTG GAA AAA GTA CAA GAA GGA TCC GCC GTC ACT AGG GGT ACA ATA AAT GAC CCG CAA AGA 

Leu Glu Lys Val Gin Glu Gly Ser Ala Val Thr Arq Gly Thr lie Asn Asp Pro Gin Arq> 



190 200 * 210 220 230 240 

* * * * * * 

GGA AAA GAA GCC GTC ACT AGG GGT ACA ATA AAT GAC CCG CAA AGA CCA AAA GAA GCC GTC 
Ala Lys Glu Ala val Thr Arc? Gly Thr He Asn Asp Pro Gin Arq Ala Lys Glu Ala Val> 

250 ^^®"260 270 280 290 ^300 ^ 

* * * # * * 

ACT AGG GGT ACA ATA AAT GAC CCG CAA AGA GCA AAA GAA GTC GAC AGA CTG TTT OCT AGG 
Thr Arg Gly Thr lie Asn Asp Pro Gin Arg Ala Lys Glu Val Asp Arg Val Phc Pro Arg> " 
— * S&X. Ml/lti 2 

310 320 330 340 3S0 7 360 

« * * * • * 

GGG ACG GTA GAA AAC CCG GAC AAA GCA CCA AGA GTG TTT CCT AGG GGG ACG GTA GAA AAC 
Gly Thr Val Glu Asa Pro Asp Lys Ala Arq Arq Val Phc Pro Arq Gly Thr Val Glu Asn> 

400 410 420 

* * • * « « 

CCG GAC AAA GCA CGA AGA GTG TTT CCT AGG GGG ACG GTA GAA AAC CCG GAC AAA GCA CGA 
Pro Asp Lys Ala Arq Arq Val Phs Pro Arq Gly Thr Val Glu Asn Pro Asp Lys Ala Arq> 

_ \ ribUS •> 

430 ' 4*0 450 460 470 480 

i * 9 * X « 

CCA TGG AGA GTG CGT TAT ACT AGG CAT ACG CCA GAA GAT. AAG CTA AAA AAA AGA GTG CGT 
Pro Trp Arg Val Arq Tyr Thr Arg His Thr Pro Glu Asp Lys Leu Lys Ly3 Arg val Axg> 

/,wu *** 490^" ' 500 510 520 530 ' ^ ' 540 

• W W * ■ f 

TAT ACT AGG CAT ACG CCA GAA GAT AAG CTA AAA AAA AGA CTG CGT TAT ACT AGG CAT ACG 
.. Tyr Thr Arq His Thr Pro Glu Asp Lvs Leu Lys Lys Arg Val Arq Tyr Thr Arq His Thr> 

555 

* * 
CCA GAA GAT AAG CTA AAA AAA 
Pro Glu Asp Lys Leu Lys Lys> 
? 
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Figure 8 

M24-S-6-19-C-TSEM S**>"~ 

10 20 30 « SO 

Met Val Ala Thr Axq Ser Gin Thr Asp Thr Leu ciu 

rl^— 9 eo » ioo no x, ? 

AOa Val Thr Arg Oly The U« asm A-p Pro Gin mi ^ 

/*f — ns — * I* »» l6 ? p ? 18 ° 

• 1 . TfY* ft{*Ti f~*rr" (-'"T tiiiti »nm » 

■sasBSsassss £" s a ^f^r^T: 

— 100 210 M0 »» "? 

* " r*nr iir AAA ATT TCA GAC GCA AGO CGT AAG 
CAT ACQ CCA GAA - GAT AAG CTA AAA AAA CTG CAG AAC AAA ATT TCA ^ ^ ^ ^ ^ 

His Thr Pro Clu Ase Us lv» Us ^» « n *j£ * u „ Ser ^ Al4 tv s> 

280 290 300 



250 260 



Glv UU ATQ Atq ASO WU asp 

* M no 340 3S0 360 



3B0 am 400 «0 «? 



4,0 '» 160 00 



500 52° 530 540 







310 








CAA 


AAA 


CTT GAA 


GlA 


LyS 


LfiU GlU 


Gin 


LV« 


Leu Glu 






370 
* 


tta 


GAC 


GCA TCA 


Leu 


Asp 


Ala ser 


Leu 


Asp 


Ala Ser 






430 
* 


CAA 


AAC 


AAG ATT 


Gin 


Asn 


Lys Ii« 


Gin 


Asn 


Lvs rle 






490 
* 


GAA 


GCT 


AAG AAA 


GlU 


Ala 


Lys LVS 


GlU 


Ala 


Lys LVS 






550 
• 


GAA 


AAA 


CTT AAC 


GlU 


Lys 


Leu Asn 


GlU 


tvs 


Leu Asn 






610 


CTA 


CAA 


GCA AAA 


Leu 


Gin 


Ala Lys 


Leu 


Gin 


Ala Lys 






670 
■ 


GAA 


GAA 


CTT GCA 


GlU 


GlU 


Leu Ala 


GlU 


GlU 


Leu Ala 






730 
• 


CCA 


GGA 


AAC AAA 


pro 


GlV 


Asn Lys 


pro 


GlV 


Asn Lvs 






790 



800 



4 IV 

# 






TCG CGT 
Scr Arq 

eac ire 


GAA 
GlU 
Glu 


GCT 
Ala 
Ala 


■ 






ATT TCA 
lie Ser 


GAA 
GlU 
GJ.U 


GCA 
Ala 
Ala 


390 
* 






a an r*u 
Lys Gin 
Lys Gin 


Leu 


ftAA 
Glu 
Glu 


450 
* 






CGC AM 
Arq Lys 
Atq Lys 


GGC 
Glv 
GlV 


CTT 
Leu 
Leu 


510 






GCT TTA 
Ala Leu 
Ala IAU 


GAA 
Glu 
Glu 


GAA 
GlU 
Glu 


S70 
* 






GAA AGC 
Glu ser 
GlU Ser 


AAG 
Lys 
Lys 


AAA 
Lvs 
Lvs 


630 
* 






CCA AAA 

Ala Lys 
Ala Lys 


GCA 
Ala 
Ala 


CTC 
Leu 
Leu 


690 
* 






GGA AAA 
Gly Lys 
Glv Lys 


GCA 
Ala 
Ala 


TCA 
Sec 

scr 


750 
* 






AAA GGT 

Lys Gly 
Lys Gly 


CAA 
Gin 
GLn 


GCA 
Ala 
Ala 


810 







SS0 -470 SB0 590 600 



620 C3Q 640 650 660 



€S0 «■» 7 °° 110 ™ 



7 , 0 i« 14» 110 



820 830 840 
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M24»S*6-19-OTERM T ranslated s^^ea >7 

Wednosday, September 9. 1992 1:46 PM ' Pa 9G 

CAA AAC AAA GCA CCA AIG AAG GAA ACT AAG AGA CAC TXA CCA TCA ACA GOT GAA ACA GCT 
Girt Asn Lys Ala Pro Mec Lys Glu Thr Lys Arg Gin Leu Pro Ser Thx Gly Glu Thr AJ.a> 
Gin Asn Lys Ala Pro Mec Lys Glu Thr Lys Arc? Gin Leu Pro Ser Thr Gly Glu Thr Ala> 

8S0 860 870 880 890 900 

* * * * * * 
AAC CCA TTC TTC ACA GCG GCA GCC CTT ACT GTT ATG CCA ACA GCT CGA GTA GCA GCA GTT 
Asn Pro Phe Phe Thr Ala Ala Ala Leu Thr Val Met Ala Thr Ala Gly Val Ala Ala Vai> 
Asn Pro Phe Phe Thr Ala Ala Ala Leu Thr Val Met Ala Thr Ala Gly Val Ala Ala Val> 

910 920 

* • 

GTA AAA CGC AAA GAA GAA AAT TAA 

Val Lys Arg Lys Glu Glu Asn ***> 
Val Lys Arq Lys Glu Glu Asn ***> 
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Figure 9 
ML9-M6-MS-M24 TJEBAVALENT HfflPID 

10 ' 20 30 40 50 60 

AIG AGA GTG CGT TAT .ACT AGS CM ^CG.O^ g£ £o ^ ^ LyS Lys lie lie Asp Asp> 
^ Arg.Val. Arg>Tyfc^tar. Arg His mc- Pro Glu Asp Lys uzn y 

- ' • " •/-• an 100 110"." 120 

70 80 go xuu ^ ^ 

u. 140 . » 16 ? "5 18 2 



190 200 



210 220 230 240 



I— .m- «m ™» or aat AAT GftC AAC TTA GTC GAC GCC GIG ACT AGS GGT 
g^S^IS £ S 2 . S 2 £ I- Val^P ^ 

260 .,270 280 — 290 300 

no 320 . 330 340 350 360 

~ *™ ftBr AAT GAA GGG TTA AAA ACT GAG AAT GAA GGG TEA AAA ACT GAG AAT 

370 380 390 400 410 420 

ran nrP- TTA AAA ACT GAG AAT GAA GGG TTA AAA ACT GAG CCA. TGG GTC GCG ACT AGG TCT 
£u S S £ S Glu Gly Leu L yS rnr Glu ^ ^^^^ 
430 440 450 460 470 480 

CAG ACA GAT ACT CTG GAA AAA GTA CAA GAA CGT GCT GAC AAG TTT GAG ATA GAA AAC 

Gin Thr Asp Thr Leu Glu Lys Val Gin Glu Arg Ala Asp Lys Phe Glu lie Glu Asn Asn> 

490 500 510 520 530 540 

ACG TTA AAA CTT AAG AAT ACT GAC TTA AGT TTT AAT AAT AAA GCG TTA AAA GAT CAT 

Tbr Leu Lys Leu Lys Asn Ser Asp Leu Ser Phe Asn. Asn Lys Ala Leu Lys Asp His Asn> 

550 560 570 580 590 600 

GAT GAG TTA ACT GAA GAG TTG AGT AAT GCT AAA GAG AAA CTA CGT AAA AAT GAT AAA 
S Su S tS Glu Glu Leu Ser Asn Ala Lys Glu Lys Leu Arg Lys Asn Asp Lys Ser> 

a0 620 630 640 650 660 

* * * 

CTA TCT GAA AAA GCT AGT AAA AAT CAA GAA TTA GAG GCA CGT AAG GCT GAT CTT GAA 

Leu Ser Glu Lys Ala Ser Lys Asn Gin Glu Leu Glu Ala Arg Lys Ala Asp i^u Glu Lys> 
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680 690 700 710 720 

* * * * * * 

GCA TTA GfiA GGC QCA_AlG-AAg-nT TCa t _AC^_C3CG GaX-10U^/^fl3^^ 

Ala leu Glu Gly Ala Wet Asn Pfae Sec Thr Ala Asp Ser Ala lys He Lys Thr Lsu Giu> 

730' 740 

GCA GAG AAA GO? C2T fflA- 
Ala Glu lys Ala Asp Leu Glu> 
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Figure 10 

Sequence Ranqo: 1 to 522 

10 29 in 40 CO *6 

« * « * * * 

ATC GTC GCC ACT AGC ICT CAG ACA GAT ACT CTG GAA AAA GTA CAA GAA CGT GCT GAC AAG 
Met Val Ala Thr Arq Ser Gin Thr Asp Thr Leu GLu Lys Val Gin Glu Arq Ala Asp Lys> 

M<H/ ^ Q0 90 10 0 110 120 

« t « * * * 

ITT GAG ATA GAA AAC AAT ACG TTA AAA CTT AAG AAT ACT GAC TTA AGT TTT AAT AAT AAA 

Phe Glu lie Glu Asn Asn Thr Leu Lys Lgu Lys Asn Ser Asp Leu Sor Phe Asn Asn Lys> 

130 140 ISO 160 HO 180 

* . « ... 

GCS TTA AAA GAT CAT AAT GAT GAG TTA ACT GAA GAG TlXS AGT AAT GCT AAA GAG AAA CTA 
Ala Leu Lys Asp His Asn Asp Glu Leu Thr Glu Clu Leu Ser Asn Ala Lys Glu Lys Leu> 

igO * 200 210 220 230 240 

» * t * 9 

CGT AAA AAT GAT AAA TCA CTA TCT GAA AAA GCT AGT AAA AAT CAA GAA TTA GAG GCA CGT 

Ac? Lvs Asn Asp Lys Ser Leu Ser Glu Lys Ala Ser Lvs Asn Gin Glu Leu Glu Ala Arq> 

250 260 270 280 290 300 



t 



AAG GCT GAT CTT GAA AAA GCA TTA GAA GGC GCA ATG AAT TTT TCA ACA GCG GAT TCA GCT 
Lys Ala Asp Leu Glu Lys Ala Leu Glu Gly Ala Met Asn Phe Scr Thr Ala Asp S«r Ala> 

310 320 330 340 350 360 

4 * * * » * 

AAA ATC AAA ACC TTA GAA GCA GAG AAA GCT GAT CTT GAA GGA TCC GCC GTG ACT AGG GCT 
Lvs He Lys Thr Leu Glu Ala Glu Lys Ala Asp Leu Glu Glv Ser Ala Val Thr AroGlv> 

QXHWt MS* * ^ 

370 380 390 400 410 420 

* • * * « * 
ACA ATA AAT GAfc CCG CAA AGA GCA AAA GAA GCT CTT GAC AAG TAT GAG CTA GAA AAC CAT 
Thr lie Asn Asp Pro Gin Arg Ala Lys Glu Ala Lau Asp Lys Tyr Glu Leu Glu Asn His> 

430 440 450 460 470 480 

• w * * « * 

GAC TTA AAA ACT AAG AAT GAA GGG TTA AAA ACT GAG AAT GAA GGG TTA AAA ACT GAG AAT 
Asp Leu Lys Thr Lys Asn Glu Gly Leu Lys Thr Glu Asn Glu Gly Leu Lys Thr Glu Asn> 

490 500 510 520 

■ • « • 

GAA GGG TTA AAA ACT GAG AAT GAA GGG TTA AAA ACT GAC- TAA 
Glu Gly Leu Lvs Thr Glu Asn Glu Gly Leu Lys Thr Glu ♦**> 
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Figure 11/ 

TETHAV31LENT-C R2FKAT TranilAc*^ S«gu*ao* 

Sequence Range: 1 Co 1029 

10 20 30 40 50 60 

* « » * « * 

ATG ASA GTC CCT TAT ACT AGG CAT ACG C CA GAA GAT AAG CTA AAA AAA AIT ATT GAC GAT 

Met Arq Val Arq Tyr Thr Arg His Thr Pro Glu Aso Lys Leu Lys lys lie He Asp Asp> 



70 ^ ao 90 too no 120 . 

* * * t * * 

CTT GAC GCA AAA GAA CAT GAA TTA CAA CAA CAG AAJ Q&Q MCr TTA TT.T fifiA TTC A.GA CTC 
Leu Asp *ua uys uiu Hir G.'.u Leu Gin Gin Gin Asn Glu Lys Leu Ser Gly Ser Arg Val> 

130 140 150 160 170 180 

* t * * t * 

ITT CCT AGG GGC ACG GTA GAA AAC CCG GAC AAA GCA CGA GAA CTT CTT AAC AAG TAT GAC 
Phe Pro Arq Gly Thr Val Glu Asn Pro Asp Lvs Ala Arq Glu Leu Uu Asn Lys Tyr Asp> 

190 200 210 220 230 240 

* • « * « • 

GTA GAG AAC TCT ATG TTA CAA GCT AAT AAT GAC AAC TTA GTC GAC GCC GTG ACT AGG GGT 
Val Glu Asn Ser Met Leu Gin Ala Asn Asn Asp Asn Leu Val Asp Ala Val Thr Arq Gly> 

250 260 270 280 290 300 

* * • * * « 

ACA ATA AAT GAC CCG CAA AGA GCA AAA GAA GCT CTT GAC AAG TAT GAG CTA GAA AAC CAT 
Thr He Asn Asp Pro Gin Arg Ala Lys Glu Ala Leu Asp Lys Tyr Glu Leu Glu Asn His> 

310 320 330 340 3S0 360 

k * * * • * 

GAC TTA AAA ACT AAG AAT GAA GGG TTA AAA ACT GAG AAT GAA GGG TTA AAA ACT GAG AAT 
Asp Leu Lvs Thr Lys Asn Glu Gly Leu Lys Thr Glu Asn Glu Gly Leu Lys Thx Glu Asn> 

370 380 390 400 410 420 

* * i ft * * 

CAA GGG TTA AAA ACT GAG AAT GAA GGG TTA AAA ACT GAC CCA TGG CTC GCG ACT AGG TCT 
Glu Gly Leu Lys Thr Glu Asn Glu Gly Leu Lys Thr Glu Pro Trp Val Ala Thr Arq Scr> 

430 440 4S0 460 470 460 

* At * r * 

CAG ACA GAT ACT CTG GAA AAA GTA CAA GAA CGT GCT GAC AAG TTT GAG ATA GAA AAC AAT 
Gin Thr Asp Thr Leu Glu Lys val Gin Glu Arg Ala Asp Lys Phe Glu He Glu Asn Asn> 

490 S00 510 520 530 540 

* * • « W • 

ACG TTA AAA CTT AAG AAT AGT GAC TTA ACT TTT AAT AAT AAA GCG TTA AAA GAT CAT AAT 
Thr Leu Lys Leu Lvs Asn Ser Asp Leu Scr Phe Asn Asn Lys Ala Leu Lys Asd His Asn> 

550 560 570 580 590 600 

* * • • * * 

GAT GAG TTA ACT GAA GAG TTG AGT AAT GCT AAA CAG AAA CTA CGT AAA AAT GAT AAA TCA 
Asp" Glu Leu Thr Glu Glu Leu Ser Asn Ala Lys Glu Lys Leu Arq Lys Asn Asp Lys Ser> 

610 620 630 640 650 660 

t » * # « * 

CTA TCT GAA AAA GCT AGT AAA AAT CAA GAA TTA GAG GCA CGT AAG GCT GAT CTT GAA AAA 
Leu Ser Glu Lys Ala Ser Lys Asn Gin Glu Leu Glu Ala Arq Lys Ala Asp Leu Glu Lys> 

670 660 690 700 710 720 

* • * ft m * 

GCA TTA GAA GGC GCA ATG AAT TTT TCA ACA GCG GAT TCA GCT AAA ATC AAA ACC TTA GAA 
Ala Lgu Glu Gly Ala Mec Asn Phe Ser Thr Ala Asp Ser Ala Lys He Lys Thr Leu Glu> 

730 740 750 760 770 780 

* • « « « * 

GCA GAG AAA GCT GAT CTT GAA CGA TCG AAC AAA ATT TCA GAC GCA AGC CGT AAC GGT CTT 

Ala Glu Lys Ala Asp Leu Glu Arq S<?r Asn Lys lie Ser Asp Ala Ser Arq Lys Gly Lcu> 

Asn Lvs lie Ser Asp Ala Scr Arq Lys Gly Leu> 
MC a T£j0>! Qgf*&)T / 

790 800 310 820 830 840 

* • « m • * 

CGT CGT GAC TTA GAC GCA TCG CCT GAA GCT AAG AAG CAA TTA GAA GCT GAA CAC CAA AAA 
Arq Arq Aop L*u Asa AI* S«t Arq Glu Ala uys Lvs Gin Leu Glu Ala Glu His Gin Lys:> 
Arq Arq Asp Leu Asp Ala Ser Arq Glu Alo Lys Lys Gin Leu Glu Ala G1j His Gin Lys> 

8S0 8<>0 870 880 890 900 

* * » • « « 
CTT GAA GAA CAA AAC AAG ATT TCA GAA GCA AGT CGC AAA GGC CTT CGC CGT CAT TTA GAC 
Leu Glu Glu CLn Asn Lys lie Set Glu Ala Ser Arq ly* Gly Leu Arq Arq Asp Leu Asp> 
Leu Glu Glu Gin Asn Lys ile Ser Glu Ala Ser Arg Lys Gly Leu Arg Arg Asp Leu Asp> 
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TETRAVALENT-C REPEAT Translated ,^quence 



910 920 930 940 95C 960 

GCA TCA CGT GAA GCT AAG AAG CAA TTA GAA GCT GAA CAA CAA AAA CTT GAA GAA CAA AAC 
Ala Ser Arg Glu Ala Lys Lys Gin Leu Clu Ala Glu Gin Gin Lys Leu Clu Glu Gin Asn> 
Ala Ser Arq Giu Ala Lys Lys Gin Leu Glu Ala Glu Gin Gin Lys Leu Glu Glu Gin Asn> 

97Q 980 990 1000 1010 1020 

AAC ATT TCA GAA GCA AGT UA: AAA GGC CTT CGC CGT GAT TTA GAC GCA TCA CGT GAA GCT 
Lys He Ser Glu Ala Ser Arg Lys Gly Leu Arg Arg Asp Leu Asp Ala Ser Arg Glu AJ.a> 
Lys lie Ser Glu Ala Ser Arq Lvs Glv I* u Arq Arq Asp Leu Asp Ala Ser Arq Glu Ala> 

AAG AAA CAA 
Lys Lys Gln> 
Lys Lvs Gln> 
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Figure 



12 



M24-W5-M6-M19 MJLTIVMEJT HSBKCD M PFOTEIN WITH SfiOKT SUBCTCTTS (15 M) 
~" 10 20 30 40 SO ^~ 

V&t Val ALa Thr Arg Sex Gin Thr Asp.Ttir I«i Glu Lys Val Gin Glu Val> 
1424 tT'" 80 ' 90 100 HO . " 120 

* W * * * 

ACT AGG GGT ACA ATA AM" GAC CCG CAA AGA GCA AAA GAA GTC GAC AGA GIG TTT CCT AGG 
Thr Arg GLy Thr-Ile Asn Asp Pro Gin Arg Ala Lys Glu Val Asp Arg Val Phe Pro Atg> 

Sai l Mb - • 

130 140 150 160 170 180 

* * * * * * 

GGG AGG GTA GAA AAC COS GAC AAA GCA OGA CCA TGG AGA GTG OGT TAT ACT AGG CAT AGG 
Gly Thr Val Glu Asn Pro Asp Lys Ala Arg Pro Trp Arg Val Arg Tyr Thr Arg His Thr> 

Nool ML9 . ■ • • 

190 200 
* . * 

CCA GAA GAT AAG CTA AAA AAA TAA 
Pro Glu Asp Lys Leu Lys Lys 
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Figure 13 

M24-M5^6H&9^^^&8-*112 MJLTTVAI£OT HiSRID M PPOIEIN 

10 20 30 40 50 60 

* ★ * * * * 

ATG GTC GOG ACT AGS TXTT CAG ACA GAT ACT CIG GAA AAA GTA CAA GAA GGA TOC GCC GIG 
Met Val Ala Thr Arg Ser Gin Thr Asp Thr Leu Glu Lys Val Gin Glu Gly'Ser Ala Val> 
M24_ . . . Barrtil M S, _ 

70 80 90 100 110 . 120 

* * * * * * 

ACT AGS GST ACA AIA AAT GAC CCG CAA AGA GCA AAA GAA GTC C^C AGA GTG TIT OCT AGG 
Thr Arg Gly Thr lie Asn Asp Pro Gin Arg Ala Lys Glu Val Asp Arg Val Phe Fro Arg> 

Sail • M6 . . 

130 140 150 160 170 180 

* # * * * * 

GGG ACG GTA GAA AAC CCG GAC AAA GCA CGA OCA TOG AGA GTG CGT TAX ACT AGG CAT ACG 
Gly Thr Val Glu Asn Pro Asp Lys Ala Arg Pro Trp Arg Val Arg Tyr Thr Arg His Thr> 

Ncol M19 

190 200 210 220 230 240 

CCA fflA ©T AA3 CCA AAA AAA CIG CAG GAT GCT AGS AGT GIT AAT GGA GAG TTT CCT A» 
Pro Glu Asp Lys Leu Lys lys leu Gin Asp Ala Arg Ser Val Asn Gly Glu Phe Pro Arg> 

Pstl M3 

250 260 270 280 290 300 

* * * * * * 

CST GTT AAA TTA ATC GAT AAC GCT GAT GCT AAT OCT AGG GAA GIT ATA GAA (ST CTT GCA 
His Val Lys Leu lie Asp Asn Gly Asp Gly Asn Pro Arg Glu Val He Glu Asp Leu Ala> 
'Cla l ML . . . 

310 320 330 340 350 360 

* * * * ' * * 

GCA ©A TTC GCA OCT CTT ACT CGA GCT ACA OCA CTC AAT AAA GbC G^A TTA ATA CC^ TOG 
Ala Glu Phe Ala Pro Leu Thr Arg Ala Thr Ala Asp Asn Lys Asp Glu Leu He Arg Ser> 

EooRl M18 Pvul 

370 380 390 400 

* it it * 

CAT AGT GAT TTA GTC OCA GAA AAA CAA OCT TTA GAA GAT TTA GGA TAA 
His Ser Asp Leu Val Ala Glu Lys Gin Arg Leu Glu Asp Leu Gly ***> 
M12 
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